Gene Haur_2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2265 
Symbol 
ID5734152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2897430 
End bp2898572 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content50% 
IMG OID641279406 
Productextracellular solute-binding protein 
Protein accessionYP_001545033 
Protein GI159898786 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCC ATTTCACGCG CATTGTGGCG CACGTGTTCA CTGTCGGACT CATGGCCTCG 
TTGATCAGTT GTGGGAGCAC GCAAGTAACG CCTACCGTCA GCCAAAGTGC CCAATCAGTT
TCAGCCTCAA ACGCCGATCC CGCTTTAGTT GAAGCCTCAA AAAAAGAAGC CGATGGTATT
TTGATTTATT CGATTATGAG TGAGAAAAAT TGGCAGCCAG TGATTCAAGG CTTTAACGCC
AAATATCCTT GGATCAAAGT AACAACTGCT GATTTAGGAG CTTACGAGGT ATTCGAACGC
TACTACACCG AAGCCGCTGG CAATGCCCGT ACTGCCGATT TTATCTCCAC CACCGCGCCT
GATGGCTGGA TCAACTTTAT CGATAAAGGC GAAGTCCAGT CGTATGTTTC CAGCGAAGAT
GCCCAAATTC CTGAGCTTGG CAAAATTGCT CAAGGGGTCT ATGCCGCCTC AACCGATCCG
ATGGTCTTTA TTTGGAACAA GCAATTGGTC GCTGATCCGC CGCAAACTAT GGCCGAACTG
ATCAGCATGA TCGAAAAAGA CCCCAGCGCC ATGCAAGACA AATTGGTGAG CTACGATGCC
GATGGCGAAG GCTTTGGCTA TGGTCTCAAC TGGTTTTATA CCAAGGCCAA AGGTGCTGAT
GGTTGGAAAA CTTTAGAAGC AATTGGTTCA AGCCAGCCCA AAATTTTAAC CTCTGGCGGC
AAGATGATCG ATTCAGTGCT TTCGGGCGAA TCCAGCATCG GCTACTTTGT CTCGAATATC
ACAGTTCAGC CACGCTTGGA AGCTGCTCAA CAATTGCTTG GCTATAGCTT TGTGCCTGAT
GCTCAAGTTG TCGCGGTGCG GGGCATGGCA GTCACCAAAC ATGCTGCTAG CCCTAATTCA
GCCAAATTGC TACTCGATTA TATTCTCTCA GCCGAAGGAC AAATGGCCTT CTCGCAAGGT
GGCTTAACCG CCTATCGCCC CGACATTGCC GCCTCTGCGC CACTGCATCT TTCGCAAGTA
AGCCAAGCGG TTGGGGGCGA GCAAAATATC ATCTTCTCAC GGCCCGATAA AGCCCTGGCT
GACCCTCAAC AACGCAGCCA GTTCCTCAAC CAATGGAAAC AAGCCCTCGG TCGCAACCAA
TAA
 
Protein sequence
MKSHFTRIVA HVFTVGLMAS LISCGSTQVT PTVSQSAQSV SASNADPALV EASKKEADGI 
LIYSIMSEKN WQPVIQGFNA KYPWIKVTTA DLGAYEVFER YYTEAAGNAR TADFISTTAP
DGWINFIDKG EVQSYVSSED AQIPELGKIA QGVYAASTDP MVFIWNKQLV ADPPQTMAEL
ISMIEKDPSA MQDKLVSYDA DGEGFGYGLN WFYTKAKGAD GWKTLEAIGS SQPKILTSGG
KMIDSVLSGE SSIGYFVSNI TVQPRLEAAQ QLLGYSFVPD AQVVAVRGMA VTKHAASPNS
AKLLLDYILS AEGQMAFSQG GLTAYRPDIA ASAPLHLSQV SQAVGGEQNI IFSRPDKALA
DPQQRSQFLN QWKQALGRNQ