Gene Haur_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2471 
Symbol 
ID5734352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3161287 
End bp3162459 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content51% 
IMG OID641279611 
Productextracellular solute-binding protein 
Protein accessionYP_001545237 
Protein GI159898990 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAAT CGAAGCTTGG TGCTTTTCGG CTACGTGCTG CTAAATTAGC GGCCTTATTC 
CTCGTCGTTT CGTTGATGAT TAGTGCGTGT GGCTCAGCTG CCAGCGAAAG CACCACCGCC
CCAACCGCTG GCTCAGGTGG CACATCATCA ACTATGGATG CTTTGATTGC TGCTGCCAAA
GCCGAAGGCG AATTAACCGT GATCGCCTTG CCCCACAATT GGCTCAACTA CGGCGAAATG
ATCGAAAACT TCTCGAAAAA ATATGGCATC AAAATCAACG AACTGAATCC TGATGCTGGT
TCGGGCGACG AAATCGAAGC GATCAAGGCC AATAAAGATA ACAAAGGCCC ACAAGCCCCC
GATGTGATCG ACGTTGGCTT TGCTTTTGGT CCATCCGCCA AGCAAGAAAA TCTCTTGCAG
CCCTATAAAG TTTCAACCTG GGATACGATT CCTAACGAGT TGAAAGATCC TGAGGGCTAT
TGGTTTGGCG ATTACTATGG CGTGTTGGCG TTTGCCGTCA ACAAAGATGT GGTCAAAAAT
GTGCCGCAGG ATTGGGCCGA CCTCTTGAAG CCTGAATACA AAGGCCAAGT CGCCTTGGCA
GGCGATCCAC GGGTCTCAAA CTTGGCGATT CAGTCAGTCT ACGCCGCAGC ACTTGCTAAT
GGTGGTAGCT TGGATAACGT TCAACCTGGC TTGGATTTCT TCAGCAAATT GAACCAAGCT
GGCAACTTTG TGCCAGTTAT CGCCAAACAA GGTACCTTGG CCCAGGGCGA AACCCCAATT
ATGATCACCT GGGACTATTT GGCAATCAGC GCTCGCGATG AATTGGGCGG TAACCCTGAA
ATCGAAGTGG TTGTGCCAAA ATCAGGCGTG CTTGGTGGCG TGTATGTTCA AGCGATCAGC
GCTTATGCTC CGCACCCAAA TGCTGCCAAA TTGTGGATGG AATACCTCTA CTCCGACGAA
GGCCAATTGA CTTGGCTCAA AGGCGGCGGC CACCCAGTGC GCTACAACGA TTTGGTTGCT
CGCAATGTAA TTCCAGCCGA AATTGCGGAA AAATTGCCAC CAGCCGAACT CTACGCCAAC
GCCGTATTCC CAACCTTGGC GCAGCTCGAA GCTGCCAAAA AAGTTATCGT CGATGGTTGG
GATACCACGG TCAACGTCGA TGTCAAAGAA TAA
 
Protein sequence
MAQSKLGAFR LRAAKLAALF LVVSLMISAC GSAASESTTA PTAGSGGTSS TMDALIAAAK 
AEGELTVIAL PHNWLNYGEM IENFSKKYGI KINELNPDAG SGDEIEAIKA NKDNKGPQAP
DVIDVGFAFG PSAKQENLLQ PYKVSTWDTI PNELKDPEGY WFGDYYGVLA FAVNKDVVKN
VPQDWADLLK PEYKGQVALA GDPRVSNLAI QSVYAAALAN GGSLDNVQPG LDFFSKLNQA
GNFVPVIAKQ GTLAQGETPI MITWDYLAIS ARDELGGNPE IEVVVPKSGV LGGVYVQAIS
AYAPHPNAAK LWMEYLYSDE GQLTWLKGGG HPVRYNDLVA RNVIPAEIAE KLPPAELYAN
AVFPTLAQLE AAKKVIVDGW DTTVNVDVKE