Gene Francci3_1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1693 
Symbol 
ID3903270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2030717 
End bp2032606 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content72% 
IMG OID637879031 
ProductType IV secretory pathway VirB4 components-like 
Protein accessionYP_480798 
Protein GI86740398 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000521447 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.17897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAC GAGCCCGACG CCGGACCTCC GCGCAGGCCC CCAGCCCGTC GACCCGGACA 
GTGGACGCTG CCGCGGCGGC GTTCGTCCCG GACGCACTCA CGATCGCGCC CCGCCACCTG
GACGTCGGCG GGGATTACGT GGCCACGATG GCGATCACCG GCTATCCGCG TGAGGTCCAT
GCCGGCTGGC TCGCCCCGCT GGTGACCTAC CCGGGCCGGG TCGACGTCGC CGTGCACGTC
GAGCCGATTG ACCCGGTCAC CGCGGCGAAC CGGCTGCGCC GGCAGCTGTC GAAGCTGGAG
TCCGGCCGGC AGCTCGGCGA CGAGAAGGGC CGGCTGGTCG ACCCGCAGGT CGAGGCGGCG
ACCGAGGACG CCTACGACCT GTCCGCCCGC GTAGCCCGCG GCGAAGGCAA GCTGTTCAGG
CTGGGTCTGT ATTTCACCGT CCATGCGGCC AGCGAAGCCG AGTTGGCCGA CGAGGTCGCC
GCCGTGCGGG CGCTGGCGGC CAGCCTGCTG CTGGACGCCA AGCCAGTCAG CTACCGCTCG
CTCCAGGGCT GGGTCAGCAC CCTGCCCCTC GGCTTGGACC AGGTGCGGAT GCGCCGCACC
TTCGACACCG CAGCCCTGTC CGCGGCGTTC CCGTTCACGT CGCCCGATCT GCCGCCCGCC
GACCCGACCT CTCTGGCTCC GACCGGGGTG CTCTACGGGC TCAACGTCGC GAGCAACGGG
CTGGTCCACT GGGACCGGTT CGGCGACGTC GACAACCACA ACGCCGTCAT CCTCGGCCGC
AGCGGCGCCG GCAAGTCCTA CCTGGTCAAG CTCGAACTCC TGCGCAGTCT CTACCGGGGC
ATCGAGGTCC ACGTCGTCGA CCCGGAAGAC GAATACGCCC GGCTCGCCGC CGCGGTCGGC
GCCAGCTACC TGCACCTCGG CGCCGACGAG GTGCGGATCA ACCCGTTCGA CCTGCCGATC
CAGACCACCC CCGACGGGCG GCGCACCGCA CCGCGCGACG CGCTGGTGCG GCGCAGCCTG
TTCCTGCACA CCGTTATCGC CGTCCTGGTC GGCCAGCTGA GTGCGGCCGA ACGGGCAGCC
CTCGACGTCG CGATCACCGC CACCTACCAG GCCGCCGGGA TCAGCTCCGA CCCGCGCAGC
TGGAACCGGC CGGCACCGCT GCTGGCCGAC CTCGCCACCA CCCTGGCCAG CTCCAACGAC
CCGGCCGCGG TCGCGCTCGG CGCCCGGCTG CACCCGTTCA CCGCCGGGGC GTTCTCCGGC
CTGTTCAACG GGCCGACGAC CCGCCGCGGC GACGGCCACC TTGTCGTCTA CTCGCTGCGC
GACCTTGCGG ACGAGTTGAA GCCGATCGGG ACGCTACTCG TCCTCGACGC CGTGTGGCGG
CGAGTCTCCA ACCCCGCCGA CCGCCGTCCC CGCTTGGTCG TAGTCGACGA GGCATGGCTG
CTCATGCGCC AGCCCGCTGG CGCGGACTTC CTGTTCCGCA TGGCCAAGTC GTCCCGCAAG
CACTGGGCCG GGCTCACCGT GGCCACCCAG GACACCGCCG ACGTGCTCGC CACCGACCTC
GGCAAAGCGA TCGTCACCAA CGCCGCCACC CAGATCCTGC TCCGCCAGGC ACCGCAGGCC
ATCGACGAGA TCACCGCCAT CTTCGACCTG TCCCAGGGCG AACGGCAGTT CCTGCTGTCC
GCCGACCGCG GACAAGGACT CCTCGCGGCG GGGGCACAAC GAGTCGCTTT CCAAGCCCTG
GCCTCGCAGG TCGAGCACCG CCTGGTCACG ACCAACCCAG CCGAACTCGC CGCCGACCCC
GACAACGCGG CCGACGACGG CTTCCTCGAT CTCGCCGTGC CGGACGACCC GACCGATGAC
AACGGCCAGA TCTACCTCGA TGCCGCCTGA
 
Protein sequence
MSRRARRRTS AQAPSPSTRT VDAAAAAFVP DALTIAPRHL DVGGDYVATM AITGYPREVH 
AGWLAPLVTY PGRVDVAVHV EPIDPVTAAN RLRRQLSKLE SGRQLGDEKG RLVDPQVEAA
TEDAYDLSAR VARGEGKLFR LGLYFTVHAA SEAELADEVA AVRALAASLL LDAKPVSYRS
LQGWVSTLPL GLDQVRMRRT FDTAALSAAF PFTSPDLPPA DPTSLAPTGV LYGLNVASNG
LVHWDRFGDV DNHNAVILGR SGAGKSYLVK LELLRSLYRG IEVHVVDPED EYARLAAAVG
ASYLHLGADE VRINPFDLPI QTTPDGRRTA PRDALVRRSL FLHTVIAVLV GQLSAAERAA
LDVAITATYQ AAGISSDPRS WNRPAPLLAD LATTLASSND PAAVALGARL HPFTAGAFSG
LFNGPTTRRG DGHLVVYSLR DLADELKPIG TLLVLDAVWR RVSNPADRRP RLVVVDEAWL
LMRQPAGADF LFRMAKSSRK HWAGLTVATQ DTADVLATDL GKAIVTNAAT QILLRQAPQA
IDEITAIFDL SQGERQFLLS ADRGQGLLAA GAQRVAFQAL ASQVEHRLVT TNPAELAADP
DNAADDGFLD LAVPDDPTDD NGQIYLDAA