Gene Francci3_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3679 
Symbol 
ID3905363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4413069 
End bp4414793 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content64% 
IMG OID637881005 
Producttrehalose synthase-like 
Protein accessionYP_482760 
Protein GI86742360 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTG AGCCTCTCAG CGAACCGATC GGCGACCACA CCCCCGGCCC GGCCCCCGCC 
ATGCCGGTCG GGGGGACTTT ACGCGACCCA CACTGGTTCA AGCGAGCCGT GTTCTACGAG
GTGCTCATCC GCGGCTTCGC GGACTCCAAC GGCGACGGCA CGGGGGACAT TCGCGGCCTG
ATCTCCAGGC TCGACTACCT GGAGTGGCTC GGTGTCGACT GTCTGTGGCT GCTACCGATC
TACTCCTCGC CGTTGCGTGA CGGCGGCTAC GACATCAGTG ACTACTTTCA GATCCTGCCG
GAATTCGGTG ACCTCGGCGA CTTCGTTAGC CTGGTTGACG AGGCCCACCG CCGGGGCATC
CGGATCATCG CGGATCTGGT GATGAACCAC ACCTCGGACG CCCATCCCTG GTTCCAGGCG
TCCCGCTCCG ACCCCGACGG GCCGTTCGGG GACTTCTACG TCTGGTCCGA CAGCGACGAG
CTGTACCCGG ACGCCCGGAT CATCTTCGTG GACACCGAGA AGTCGAACTG GTCGTGGGAT
CCGGTCCGCG GTCAGTACTA CTGGCACCGT TTCTTCTCCC ACCAGCCCGA CCTGAACTAT
GACAACCCCG AGGTCCAGGA GGCGATGCTG GAGGTTCTGC GCTTCTGGCT CGATCTCGGC
ATCGACGGGT TCCGGCTCGA CGCGGTCCCC TACCTCTATG TCCGGGAGAA CACCAACGGC
GAGAACCTGC CGGAGACCCA CGAGTACCTC AAGCGGGTCC GCAAGGAGGT CGACGCCAAG
TACGCCGACC GGGTGCTGCT CGCCGAGGCA AACCAGTGGC CCTCCGACGT CGTCGAGTAC
TTCGGCAACG ACGACGAGTG CCACATGGCG TTCCACTTCC CGTTGATGCC GCGCATCTTC
ATGGCGGTCC GGCGGGAGTC CCGCTACCCA ATCTCGGAAA TTCTTGCTCA GACACCCCAG
ATCCCGCCGA ACTGCCAGTG GGGCATCTTC CTGCGCAACC ACGACGAGCT GACGCTCGAA
ATGGTGACCG ACGAGGAACG GGACTACATG TGGGCCGAAT ACGCGAAAGA TCCGCGTATG
AAAGCCAACA TCGGCATTCG TCGCCGGCTG GCGCCGCTGC TGGACAACAG CCGTGACCAG
ATGGAGCTGT TCACCGCGCT CCTGCTCTCG CTGCCCGGCT CGCCAGTGCT GTACTACGGC
GACGAGATCG GGATGGGCGA CAACATCTAC CTGGGCGACC GGGACAGCGT CCGGACTCCG
ATGCAGTGGT CACCGGATCG TAACGCCGGT TTCTCCACCG CGGACCCGGC CCGACTGTAC
CTTCCCCTGA TCATGGATCC GGTCTACGGT TATCAGGCCC TGAACGTCGA GGCCGGACAG
CGCATGCCGA CCTCGTTCCT GGCCTGGACC AAGCGGATGA TCGAGGTACG CAAGCGTCAT
CCGGTCTTCG GGCTCGGCGA CTACACCGAG CTCGGCGCGT CCAATCCCTC GATCTTCGCC
TTCGTCCGCG AGTTCGGCGA CGACCGGGTC CTGTGCGTCG CGAATCTGTC CCGCTTCGCC
CAGCCGGTGG AGCTCGATCT GCGTCGTTTC GAGGGAATGG TTCCGGTGGA GCTGCTCGGC
CGGGTGCACT TCCCGCCGAT CGGCGAGCTT CCCTACCTGC TGACACTGCC AGGTCACGGA
CATTACTGGT TCGCGGTGAC CAGACCGGGG GAATTCACCC CCTAG
 
Protein sequence
MDSEPLSEPI GDHTPGPAPA MPVGGTLRDP HWFKRAVFYE VLIRGFADSN GDGTGDIRGL 
ISRLDYLEWL GVDCLWLLPI YSSPLRDGGY DISDYFQILP EFGDLGDFVS LVDEAHRRGI
RIIADLVMNH TSDAHPWFQA SRSDPDGPFG DFYVWSDSDE LYPDARIIFV DTEKSNWSWD
PVRGQYYWHR FFSHQPDLNY DNPEVQEAML EVLRFWLDLG IDGFRLDAVP YLYVRENTNG
ENLPETHEYL KRVRKEVDAK YADRVLLAEA NQWPSDVVEY FGNDDECHMA FHFPLMPRIF
MAVRRESRYP ISEILAQTPQ IPPNCQWGIF LRNHDELTLE MVTDEERDYM WAEYAKDPRM
KANIGIRRRL APLLDNSRDQ MELFTALLLS LPGSPVLYYG DEIGMGDNIY LGDRDSVRTP
MQWSPDRNAG FSTADPARLY LPLIMDPVYG YQALNVEAGQ RMPTSFLAWT KRMIEVRKRH
PVFGLGDYTE LGASNPSIFA FVREFGDDRV LCVANLSRFA QPVELDLRRF EGMVPVELLG
RVHFPPIGEL PYLLTLPGHG HYWFAVTRPG EFTP