Gene Francci3_2252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2252 
Symbol 
ID3905020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2629297 
End bp2630706 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content70% 
IMG OID637879583 
Productextracellular solute-binding protein 
Protein accessionYP_481349 
Protein GI86740949 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00844632 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0063434 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAGCCG TTTCCCGGCA CGACGATGCT GCTGGACATG AACGGGACGC AGGACAAGAA 
CGGGACGCAG GACAGGAACG CCCGGGCCGC CGTCCGCGGC GGCACCGGGC CCGGCTGGTC
GTCACGGCGT TGTCCGCCGC GGTCGGCCTG CTCGCCGTGG CGGCCTGCGG CTCGGACAGC
GACACCGCCT CCGGTACCCC GGCCGCGACA CCGCAAGGTG ACACCCCGGC GACCATCACC
TTCCTGTCCT ACAACTACGG CACCCCGGGT CTCGGTGGTA CGGGAACCCA GGCACTGCTC
GACGCCTTCG CCAAGGCGCA CCCGAAGATC ACGGTCAAGC CGCAGGGCGT CGCGGTGAAG
GACGTCCTCA CCCGGCTGCG CACCGACACC GCCGCCGGTG ATCCGCCCGA CGTCGCCCAG
ATCGGCTGGA GCAAGATGGC CGAGGCGGTC GACGCCCTGC CTATCACCCC GGTCCAGAAG
GTCGCCGGCA GCGAGTGGGA GTCGGCCACC GCCGGCATCT CGAAGAGCAT CCTGTCGGCC
GTCTCCACCA ACGGCGTCGT CGCGGCGATG CCGTTCACGA TGTCGATCCC GGTCATGTAC
TACAACGCCG ACCTGTTCCG CGCCGCCGGC CTGGACCCGC AACACCCGCC GACGACCCTC
GCCGACGTCA AGGCCGCCGC TTTGAAGATC AAGGCGACCG GTAAGCAGGG CGTCTACATC
AGCGTCGTCG ACAGCGGGAA GTCGGACTAC CTGACCCAGT CGGTCGTCAA CTCCAACGGC
GGCTCGCTGG TGGACAAGAA CGGCGGCGTC ACCCTCGACA AGCAGCCGGC CGTCGAGGCG
CTGGCCACGA TCGCCGACCT GACCGCCTCG GGTGCCCAGC CCGGAGTCAA GGCCGAAGCG
GCCCTGGCCG CGTTCACCAA GGGTGACCTC GGCATGTTCG TCACCAGCAC GGCGCTGCTC
GCCAGCGCCC AGAAGGCGGC GGCGGGCAAG TTCGAGCTGC GCACCGCGGG TCTGCCGTCC
TTCGGCACCA AACCCGCCCG CCCGACCTAC TCCGGCGCCG GGCTCGCGGT GCTGGCCAAG
GACCCGGCCA AGCAGCGCGC CGCCTGGGAG TTCATCAAGT TCCTCACCTC CGACGAGGGC
TTCGAGATCA TCACCTCGAA GATCGGTTAC CTGCCGCTGC GACAGAGCGT GGCGACGAAG
CTCGCCGGCA CCCCGATCGT GAAGCTGCTG GAACCGGCCC TCGACCAGCT CGACACCGTC
ACCCCCTACA CCTCGTTCCG CGGGGCGAAG GCCAACCAGG CCGTCGTCGT GCTGCAGGAC
GAGGCCGTCG AACCGATCGT CCTGCGCGGG GCCGATCCCC AGGCGACCCT GAGCAAGGCC
GCCGAGAAGA TCCGCGCACT CTCCTCCTGA
 
Protein sequence
MTAVSRHDDA AGHERDAGQE RDAGQERPGR RPRRHRARLV VTALSAAVGL LAVAACGSDS 
DTASGTPAAT PQGDTPATIT FLSYNYGTPG LGGTGTQALL DAFAKAHPKI TVKPQGVAVK
DVLTRLRTDT AAGDPPDVAQ IGWSKMAEAV DALPITPVQK VAGSEWESAT AGISKSILSA
VSTNGVVAAM PFTMSIPVMY YNADLFRAAG LDPQHPPTTL ADVKAAALKI KATGKQGVYI
SVVDSGKSDY LTQSVVNSNG GSLVDKNGGV TLDKQPAVEA LATIADLTAS GAQPGVKAEA
ALAAFTKGDL GMFVTSTALL ASAQKAAAGK FELRTAGLPS FGTKPARPTY SGAGLAVLAK
DPAKQRAAWE FIKFLTSDEG FEIITSKIGY LPLRQSVATK LAGTPIVKLL EPALDQLDTV
TPYTSFRGAK ANQAVVVLQD EAVEPIVLRG ADPQATLSKA AEKIRALSS