Gene Francci3_1592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1592 
Symbol 
ID3903727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1908918 
End bp1909982 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID637878929 
Productextracellular solute-binding protein 
Protein accessionYP_480697 
Protein GI86740297 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.641275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.651768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGTCC AGGTGGAGGG TTCAGGTGGA GTGTTCAGCT GGAGGAGCCC GATGGGTTCG 
TTAGGAAGAA CCTTGGTCGT GCTGGTGACG GCCACCTGCC TGGCCGGCCT GTCAGCGTGC
GGCTCGGGGG ACGACGGGAA GACGATCACG CTCTACAACG CGCAGCACCA GGACCTGATG
CGGGTGATGG TGGACGCGTT CACCAAGCAG ACCGGCATCA AGGTCGAGTT GCGTCGCGGC
GGCGACCCCG AGCTGGCGAA CCAGATCGTC CAGGAAGGCG ACAGCTCGCC GGCGGACGTC
TTCGTCACCG AGAACTCGCC GGCCATGACG CTGGTCGACC GCGCCGGCCG CTTCAGCAAA
CTGGACCGGG CCACCTTGGG CCAGGTGCCT GACCAGTACG TCCCGAGCAC CGGCAACTGG
GTCGGTTTCG CGGCCCGGTC GACGGTGTTC ATCTACAACC GTGGGCAGGT CGCCAAGAAC
GAGCTGCCCA CGTCGATCAT GGACCTGGCG GGACCGGCGT GGAAGGGGAA GGTCGGTGTC
GCGGCGGCCG GAGCGGACTT CCAGGCCATC GTCAGCGCCG TACTCGCGGT GAAGGGCGAG
GGGGCCACCG CCGAGTGGCT CGCCGGGCTG AAACGCAATG CGAAGATCTA CGACAACAAC
ATCGCCGCGC TGCGCGCCGT GAACGCGGGC GAGGTCCCCG CCGCTGTGAT CTACCATTAC
TACTGGTACC AGGACCAGGC GGAGTCGGGC AAGGACAGCA GGAACGTCGA CCTGCACTTC
TTCGGCCACC GGGACCCGGG CGCGTTCGTC AGCGTCTCCG GCGCCGGCGT CCTCGCGGCC
AGCGACCAGC AGGCCGAGGC GCAGCGGCTG GTCGCCTTCC TCACCAGCGA CGCCGGGCAG
AAGGCGCTGG TCGACAGCGG TGCCCTGGAG TACGCCGTGT CCGACGCGGT CCCCACGAAC
CCTGCGCTGA AGCCGCTGTC GACCCTCGAT CCGCCCGACA TCGACATCTC GACCCTGAAC
GGACCAAAGG TCGTCGAACT GATGCAGCGG GCGGGCCTGC TCTGA
 
Protein sequence
MSVQVEGSGG VFSWRSPMGS LGRTLVVLVT ATCLAGLSAC GSGDDGKTIT LYNAQHQDLM 
RVMVDAFTKQ TGIKVELRRG GDPELANQIV QEGDSSPADV FVTENSPAMT LVDRAGRFSK
LDRATLGQVP DQYVPSTGNW VGFAARSTVF IYNRGQVAKN ELPTSIMDLA GPAWKGKVGV
AAAGADFQAI VSAVLAVKGE GATAEWLAGL KRNAKIYDNN IAALRAVNAG EVPAAVIYHY
YWYQDQAESG KDSRNVDLHF FGHRDPGAFV SVSGAGVLAA SDQQAEAQRL VAFLTSDAGQ
KALVDSGALE YAVSDAVPTN PALKPLSTLD PPDIDISTLN GPKVVELMQR AGLL