Gene Francci3_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3937 
Symbol 
ID3906896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4712863 
End bp4715289 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content72% 
IMG OID637881264 
Productmembrane protein 
Protein accessionYP_483016 
Protein GI86742616 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.16689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACA TCGACGTGGA CCTGAGAGCC ACGGGTGGGG AGGACGACTC CCCACCCGAG 
GGTTCCCCTG ACGAGACCCG GTACGTGCCA CGATCCCAGC GTCCCGCGCG GGCGACCGGG
CAGCGTCCCG GAGAGCGGAC CGAACGTCTC GGCGAAGGGT TCGAACGCCG TGGCGGCCGC
CGACCCGGCG AGGTTTTCGA GGTGTCCACG GGGCCGATCC GCCCGCAGCC CGCTCCGGCG
CCGGCGCCGC GGCCGGAATC CGGGGCGGGC GAGCAGACCA TCAAGGTCTC TGGACTGCCG
TTGATCCGCC GCCCGCCCCC GGGTACCGTG CGGACCGGCC CGGTGCGGCC CCACGGCGAT
CCGTCGGAGC AACGGCCCAC CGCTTCCCCG GGCGAGATCT CCGTGAGCCG GGTGCCAACG
GCCGGGAAGG CGGGAGACGG GAAGGCGGGA GAACCGTCGG GCGAGGCCGG GGGAAGACCG
CCAGTCGACC CGGTCCGCCC CGCGGGGCCG GTGTCCGGGC GGTCCGGGCC CGCGATCCAG
CCGCCCGACG TCGTCACCGC CACGATGGAC CTGAGCGCGC TGCAACGCCG GCTGCGGGCG
GAGAAGACCG AGCTCCTCCG GCGGGTCGAG GGGCAGACTC CCCGGCATCG CCGCTCGGAC
CGGCGGGCGC GTCGAACCGA CGAGACCGAT CCGGAGAGCA CACAGATCTT TCCGCGGCCG
CCCCCCGCCG GGCGTGGCGG CCGATTTCCC GGGGCGGGCA CCGCGCGGTC CCGCGGTCGA
CGAGGGCCCG CCGCGACGGG CGGCGGCGAG CGGGCCGCCG CGGCCGCGTC CTTCGGGCCG
GGGCCCGGTA ACGGGGCCGA CGGCCTGGTG GACGATCTGA CCGGCGACGA GGGTCCGGGC
CGCGAGGGTC CGGGCCGCGA GGGTCCGGGC CGCGAGGGTT CCGGCGGCCG TGGCGGAGCC
AGGTCGAAGG CGATCTGGAC GCTGGCGGAC CAGGCCGTCT CCAGTGCGAC GAACGCCGCG
GTCTCGTTCC TGATCGCCCA TCAGGTCAGC GACGTCGAGT ACGGCGCGTT CGGTATTGCG
TACACGTTGT TCTCCATCGT CATCGGCCTG GTCCGCGCCG GAAGCTGCAT GCCGCTGAGC
ATGTTCTACT CGGGTGCCAC TCGGAGCGAC TTCCGGGCGG CGGCCACCGC CACCACCGGC
TCGTCGTTCG TCTTCGGTGT GGCCGTGGGA ATCGCGTTCG TCGGTCCGGG TCTGCTCCTC
GGCGGCCCGG TCGGGTCGTC GTTGTCGGCG ATGGGCCTCG TGCTGCCCGG CCTGCTGCTG
CAGGATGCGT GGCGCTATGT CTTCTTCGCG ATGGGGAAGC CCTTCGGCGC CTTCGTCAAC
GACACCGTCT GGGCGGTCGT CCAGATTTTC GGGATCTTCC TGCTCATCCA CCGCGGTGTG
ACCGCATCCC CGCCGCTGCT GCTCGCCTGG GGGGCCTCGG CTCTCGTCGC GGCCCTGCTG
GGGATAGCCC AGGCCGGCTT CTGGCCGTCG CCCGGCGAGA CCCTTCGATG GCTCCGCAGG
AACAAGTCGA ACTCCGCCTA TCTGGCGGCG GAGTTCATCA CCGTCCAGGG CGCGATGCAG
ACCTCGCTGC TGGTGATCGG AGCGGTGGGT TCGCTGGCGA CGGTGGGCGC CCTGCAAGGT
GCGCGCACGC TGCTCGGCCC GACCACGGTG GTCGGGGTGG GGGTCGTGAG CTTCGCGTTG
CCGGAGTTCT CCAAGCGGAC CTCCATGACC CGCCACGCCC GGGAGCGCGC CGCCTACGCT
CTCTCGGCCC TCGTCCTCGC CATCGGCACG GCGTGGAGCC TGATCTTCTA CCTCCTTCCG
GAACGTTACG GCCAGGCGTT GTTGGGCGAC TCGTGGGACG GCGTCAGGAA CATCCTGGGG
CTGTCGATTC TGCACTATCT GGCCGCGTCG GTCCCGGTCG GACCGGCCTG CATGGTCTAC
GCACTCGGAA AAGCCAAGAT CACATTTCGG GTCAATGCGG TCTTTGCGCC GATGTTGTTT
GGTTTTCCTA TCATTGGGTT GCTTGTCGGG GAGGCGCGGG GTGCGGTCGT CGGCTATAAC
ATTGCTTTCT GGTCCATTGC GCCGGTGTGG TTTGTGCTGC TCCGTCGACT CGCTCGGGAG
CACGACGCGG AGCAGGCCGC GCTGCGGGCG GCGCGGGGCG GCTCCGGGCC TGACCCCGCC
CCGGCGGGGC CGGGAGGACG AGAGCTGCCG CGCCCGCGCC GGGCCGGGAT GAGCGACGCA
CGTCGGTCGA GACGGTCGAA CGCTGGCGAA TCTGACCATC GAATGGATCC CGTGGATGAT
GTCGAGGGTA TGGATGTCGC TCCGGAGCAC GGCCCTCGCC GGCCTGGTCG CGGAACCGGA
TCGCGGCGTG GTGGGCGGGG CACCTGA
 
Protein sequence
MTDIDVDLRA TGGEDDSPPE GSPDETRYVP RSQRPARATG QRPGERTERL GEGFERRGGR 
RPGEVFEVST GPIRPQPAPA PAPRPESGAG EQTIKVSGLP LIRRPPPGTV RTGPVRPHGD
PSEQRPTASP GEISVSRVPT AGKAGDGKAG EPSGEAGGRP PVDPVRPAGP VSGRSGPAIQ
PPDVVTATMD LSALQRRLRA EKTELLRRVE GQTPRHRRSD RRARRTDETD PESTQIFPRP
PPAGRGGRFP GAGTARSRGR RGPAATGGGE RAAAAASFGP GPGNGADGLV DDLTGDEGPG
REGPGREGPG REGSGGRGGA RSKAIWTLAD QAVSSATNAA VSFLIAHQVS DVEYGAFGIA
YTLFSIVIGL VRAGSCMPLS MFYSGATRSD FRAAATATTG SSFVFGVAVG IAFVGPGLLL
GGPVGSSLSA MGLVLPGLLL QDAWRYVFFA MGKPFGAFVN DTVWAVVQIF GIFLLIHRGV
TASPPLLLAW GASALVAALL GIAQAGFWPS PGETLRWLRR NKSNSAYLAA EFITVQGAMQ
TSLLVIGAVG SLATVGALQG ARTLLGPTTV VGVGVVSFAL PEFSKRTSMT RHARERAAYA
LSALVLAIGT AWSLIFYLLP ERYGQALLGD SWDGVRNILG LSILHYLAAS VPVGPACMVY
ALGKAKITFR VNAVFAPMLF GFPIIGLLVG EARGAVVGYN IAFWSIAPVW FVLLRRLARE
HDAEQAALRA ARGGSGPDPA PAGPGGRELP RPRRAGMSDA RRSRRSNAGE SDHRMDPVDD
VEGMDVAPEH GPRRPGRGTG SRRGGRGT