Gene Cphy_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2139 
Symbol 
ID5744145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2646855 
End bp2647898 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content34% 
IMG OID641293234 
Productextracellular solute-binding protein 
Protein accessionYP_001559244 
Protein GI160880276 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA AAATAATAAT AACGGTTGTG CTGTCACTCT TTGTCATATT TTTAGTTGGA 
TATACCACAT CAATAAAAAA TCACAATTTA AAAACTAGTG AAGAAGAGCA GGAAGAGTTA
GTAATTTATA GTTCACATCC ATTGGATTTT CTAAAACCTT TAATCGAAGA ATTCGAGTCA
AGAACAGGGA TTTTTGTAAC AGTTGTCAGT GGCGGTACCG GACAATTGAT TGACAGAATT
GAAGAGGAAC AAGATAACCC GAATGCGGAT ATATTATGGG GCGGAACTGC GTCTATATTA
AAACCTCAGA TGTATCTGTT TGAAGAGTAC TCTTGTGCCA ATGAAGATGT AATTCAAAAA
GAATTTAAGA ATAAAGAAGG AGCGTTTACT AAGTTTTCTG ATGTACCCAG TGTCTTAATG
GTTAATACAG ATTTGATTGG AAATATAAAA ATTGATGGAT ACAAAGATTT ACTAAATCCG
GAGCTAAAAG GTAAGATAGC TTATTGTAGT CCTAATGTAT CATCATCTGC CTTCGAGCAT
CTTATTAACA TGTTATATGC TATGGGAGAC GGGAATCCTG AGGATGGTTG GAATTATGTA
AAACTCTTTT GTAATAATTT AGACGGGAAT CTATTATACA GTTCTACAGA TGTTTACCGT
GGAGTTGCAA ATGGTGAATT TGTTGTAGGG CTTATATTTG AAGAAGCTGC TGCCGCATTG
GTGGCGAATG GAGAACATAT TAAAATTACA TACATGGAAG AAGGAGTTTT ATCTACACCT
GATTGCGTTA CCATTGTTAA AAATTCGCCT CATCTTAAAA ATGCCAGGGC TTTCATTGAT
TTTGCTACCG GATATGAAGT ACAGACGATG ATAACAATGG AGCTTAATAG ACGATCGGTT
CGTGACGATG TAAAAACTCC AACGTACCTT AAAGCAAAGG ATGAGATTGC AATTATTCAT
GCAGATAATG AACTAATTTA TGAAATGAAA AAAGAATGGA TACGAAAATT TGAAGAAATA
TTTCTAGATA TCAAAGAAGA ATAG
 
Protein sequence
MKYKIIITVV LSLFVIFLVG YTTSIKNHNL KTSEEEQEEL VIYSSHPLDF LKPLIEEFES 
RTGIFVTVVS GGTGQLIDRI EEEQDNPNAD ILWGGTASIL KPQMYLFEEY SCANEDVIQK
EFKNKEGAFT KFSDVPSVLM VNTDLIGNIK IDGYKDLLNP ELKGKIAYCS PNVSSSAFEH
LINMLYAMGD GNPEDGWNYV KLFCNNLDGN LLYSSTDVYR GVANGEFVVG LIFEEAAAAL
VANGEHIKIT YMEEGVLSTP DCVTIVKNSP HLKNARAFID FATGYEVQTM ITMELNRRSV
RDDVKTPTYL KAKDEIAIIH ADNELIYEMK KEWIRKFEEI FLDIKEE