Gene Haur_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1722 
Symbol 
ID5733609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2004947 
End bp2006086 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content50% 
IMG OID641278864 
Productextracellular solute-binding protein 
Protein accessionYP_001544493 
Protein GI159898246 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000849376 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACTTG TGCGTTCTTT GATGCTGTTG CTCTGTCTGA GCATTGTGAT TGCAGCCTGT 
GGTGGGGAAG CAACTCCAAC GGCTGTAACA GCACCAACCA CCGCTGCAGC AACCCCAACG
ACCGCTAGCG CAGCCGAAGC TACTGCTACC GTCGAAGTGA CTGCTATTGC TGAGGTTTCG
CCAACCGTGC CAACCACCAC CACCACTGAG CCAGCTGGCG GCAGTTTGGT TGTATATTCA
GGGCGTTCAG AAAGCTTAGT TGCTCCCTTG TTGGAGCTTT TTACCCAAGA AACTGGGATT
GCCGTCGAAG TACGCTATGG CGATACTGCC GAGATGGCCG CGCAAATTCT TGAAGAAGGC
GATAATAGCC CAGCCGATGT CTTCTTCGGC CAAGATGCTG GTGCATTAGG CGCATTGAGC
GATCGCTTTG TGCAACTTGA TCGGTCATTG TTGAACTTGG TTGATCCACG TTTTGTTTCA
AGCGAAGGCA CATGGGTTGG TGCTTCGGGT CGCGCTCGGG TTTTGGTTTA TAACACCGAT
GTCTTGACCA GCACCGATTT GCCAGCTTCG ATTCTTGATT TGACCAAACC AGAGTGGAAA
GGTCGGGTCG CTTGGGCTCC AACCAATGGT TCATTGCAAG CCTTTGTCAC CGCATTGCGG
GTAAGCCAAG GTGAAGATAT GGCCAAACAA TGGCTCGAAG GCATGATCGC CAACGACGTT
AAAGTCTATG AATCGAACTC AGCGATTGTC AAAGCCGTTG CGAGTGGCGA AGTTGATACC
GGCTTGGTCA ATCACTACTA CTTGTTTGCC TTGAAAAAAG AGCAACCAGA TGCCAAGGCT
GCTAACCACT ACTTCGGCAA TGGCGATATT GGCTCGTTGG TGAATGTTGC TGGCGTGGGT
GTGCTCAAAA CTGCCAAGAA TCCTGAAGCC GCTCAAACCT TTGCTAATTA TTTGTTGAGC
GACGGTGCCC AACTCTATTT CGCCGATAAA ACCAATGAAT ATCCCTTGGT AGCTGGAATT
ATTACCAACC CAGCAATCAA GCCATTGGGC GAAATTATTA CCCCAAACAT CGATTTGAGC
AATTTGAAAG ATTTGCAAGG AACCTTAGAA TTGTTGCAAG ATGTTGGGGC GATCCAATAA
 
Protein sequence
MKLVRSLMLL LCLSIVIAAC GGEATPTAVT APTTAAATPT TASAAEATAT VEVTAIAEVS 
PTVPTTTTTE PAGGSLVVYS GRSESLVAPL LELFTQETGI AVEVRYGDTA EMAAQILEEG
DNSPADVFFG QDAGALGALS DRFVQLDRSL LNLVDPRFVS SEGTWVGASG RARVLVYNTD
VLTSTDLPAS ILDLTKPEWK GRVAWAPTNG SLQAFVTALR VSQGEDMAKQ WLEGMIANDV
KVYESNSAIV KAVASGEVDT GLVNHYYLFA LKKEQPDAKA ANHYFGNGDI GSLVNVAGVG
VLKTAKNPEA AQTFANYLLS DGAQLYFADK TNEYPLVAGI ITNPAIKPLG EIITPNIDLS
NLKDLQGTLE LLQDVGAIQ