Gene Francci3_0464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0464 
Symbol 
ID3903195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp542550 
End bp543908 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID637877795 
Productextracellular solute-binding protein 
Protein accessionYP_479579 
Protein GI86739179 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.364922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.192606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGTA TTCGCCGAAT ACTGCCGCTG CCCGTCGCCG CCACGATGAT CATCCTGACC 
GCGTGCGGCG GCGGTGGTGA CGCGAGCCCG GCAGACCCCG CGGAGAGCCT GCGCCCCACG
GCCCGTCCAG CCGACGCCGG CGTCGACAAC GTGGCGGGCG CCAAGGCGTC GCCCGCCTGC
GTGGCCCAGG TCAAGACGCT GCGGATGTCC GCCGTCGGCA CGCTCAACGA CGTCGCGAAA
TCCGGCAAGG CCTATCTGGA GAAGGCGCAT CCCGGCCTCA CCGTGGACCT CAACACGAGT
GCGCCGGACT ACACCTCCCT CGTCCAGCAG ATCAGCGCCG ACCGCTCGGC CGGCCGTTCC
GTCGACGTGG CGGTCGCCGG CTTCGACCTG CTCCCGACCT TCGCCCGGGA CCTCGGCGCC
CAGGAGCTCT CCCCACGCCT GCTGCGCGCG TCCTACGACC AACGGCTGGT CGGCCTCGGG
CAGGTCGCCG GGAAGCAGAT CGGCATTCCC CAGCAGGTGT CGTCTCTGGC CCTGGTCTAC
AACCTCGACG TGCTGCAGAA GGCCGGCGTC GATCCGGCGA CGCTGGGCAC GACCGACGGG
GTGATCGCCG CCGCCGACAA GATCAAGGCT TCGGGTCAGA ATATCCAGCC CCTCGACCTG
CCGACCGGCC AGCAGTTCGG GCAGTGGATC CTCAACACCC TGGCCAGCTC CAAGGGGACG
CCGATCCAGG ACGCGAACGG TCGGCCCGCC CTGAACACCC CGGCGGCCCG CGAGGCCGCC
GCGTTCCTCG CGAAGGCCGC GAGCTACGGC ACGCACTCCG CCGATCCGAC CCAGCAGGGC
CTGCTGCGGT TCGGCATCCG CCGGCAGACG GCGATGACCG CCGTGACGGT GGCCTCCGTG
GCCGGCGGGC TGAAGTTCAT CGCGGGGCAG GGGACGAAGG GTTTCCGGGC CGGCGCGGTC
CCGTTCCCGA CTCTGCCCGG CGGGACCCAG CACCCGGTCG CGGGCGGCAA CGCGCTGACG
GTCCTGTCCA CCGACCGCTG CCAGCGGGAG ATGGCGACCG AACTGGTCGT GTCGCTGCTT
TCGCCGGACG TCGTGGCCGC GAGCACGGAG GCGTTGAGCT ACATCCCGGT GGATACTCAG
GCCGTCAGCC AGCTCGGGTC GTTCTACGAG ACCTATCCGC AGCTCAAGCC GTTCAACGCG
CTCATCCCCT CGCTGGTGAA GGCGCCGGCT TGGAGCGGGG CCCGCGGCGG GGAGGTCCCG
AGCGCGATCT CGGACCAGGT GCAGCGCATC CTTAAGGGCG AGGACCCGGT CAGGGCCCTC
GCCGCGGCCC AGAGCCAGGC TGTGGAACTC ACCCGTTGA
 
Protein sequence
MIRIRRILPL PVAATMIILT ACGGGGDASP ADPAESLRPT ARPADAGVDN VAGAKASPAC 
VAQVKTLRMS AVGTLNDVAK SGKAYLEKAH PGLTVDLNTS APDYTSLVQQ ISADRSAGRS
VDVAVAGFDL LPTFARDLGA QELSPRLLRA SYDQRLVGLG QVAGKQIGIP QQVSSLALVY
NLDVLQKAGV DPATLGTTDG VIAAADKIKA SGQNIQPLDL PTGQQFGQWI LNTLASSKGT
PIQDANGRPA LNTPAAREAA AFLAKAASYG THSADPTQQG LLRFGIRRQT AMTAVTVASV
AGGLKFIAGQ GTKGFRAGAV PFPTLPGGTQ HPVAGGNALT VLSTDRCQRE MATELVVSLL
SPDVVAASTE ALSYIPVDTQ AVSQLGSFYE TYPQLKPFNA LIPSLVKAPA WSGARGGEVP
SAISDQVQRI LKGEDPVRAL AAAQSQAVEL TR