Gene Haur_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3046 
Symbol 
ID5734918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3847245 
End bp3848255 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content50% 
IMG OID641280190 
Productperiplasmic solute binding protein 
Protein accessionYP_001545812 
Protein GI159899565 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0205239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGAT ATAGTTTAGG GTTGTTGCTG GTATTGGTGG TTGGTTGTGG CCAAAGTACA 
GCCACAGTTC AGCCAAGCCA AGTGAGCCAA AGCCAACAAC AAACCACCAC CGCTGATCAA
ATCATGCCAA CGGCAACCGC AATTAGCACC GCGCCAGTCA GCCAAATTCG CGTCGTTACA
ACCATGAGCA TTTTGGCCGA TGTGATTAAG CAGGTTGGTG GCGAACGGGT GCTTGTTGAT
AATATTATTC CCTTGGGCGC TGGCCCCGAA GATTATCAAG CTACGCCTGG CGATAGCCAA
AAAATTGCCG ATGCCAATAT TGTGTTTTTC AATGGCCATG CGCTTGAGGA ATGGCTCGAA
CCCTTGTTCG AAAATGCTGG GGGCAGCGAG CAGCCAAGGA TTGAATTATC TGCTGGTTTT
GCAGTGATTG AAGAAGAACA TGCTGAAGAA GAACACGCTG ATGAAGAACA TGCTGATGAG
CATGCTCACG AAGAAGGCAA CCCGCACTTT TGGCTTGACC CAACCTATGT GATGTCGTAT
ACCCTGACGA TTCGCGACCA ACTTAGTGCG ATCGATCCCA GTGGCAAGGA TGTCTATGCA
GCCAATGCCG AAGCCTATCT TGGCCAATTA CAAGCGCTCG ATCAAGAATT GCAAGGCTTG
GCGGCCCAAA TTCCGGCTGA ACGGCGCAAA CTCGTGACCA ACCACGATGC CTTTCCGTAT
TTTGCCCACC ACTATGGCTT TGAAGTTGCT GGCGTGTTGT TGGATAACCC CGAAGCCGAG
CTTTCGGCTG GCGATTTAGC GGCTTTGGTC GAGAGCGTTA AGGCCAGCGG CGTGCCGGCA
ATTTTCTCTG AATCGCAGTT CAACCAAAAA ACTGCCCAAT TGCTGGCGGA TGAAGCTGGG
ATTGAAACCA TTGCGGTGTT GTATACCGAC ACTTTAGGCA GCGATACTGC AACTTCCTAT
ATCGACATGA TGCGGTACAA TATGAATACT ATTGTTGCTG CGCTCAAATA A
 
Protein sequence
MRRYSLGLLL VLVVGCGQST ATVQPSQVSQ SQQQTTTADQ IMPTATAIST APVSQIRVVT 
TMSILADVIK QVGGERVLVD NIIPLGAGPE DYQATPGDSQ KIADANIVFF NGHALEEWLE
PLFENAGGSE QPRIELSAGF AVIEEEHAEE EHADEEHADE HAHEEGNPHF WLDPTYVMSY
TLTIRDQLSA IDPSGKDVYA ANAEAYLGQL QALDQELQGL AAQIPAERRK LVTNHDAFPY
FAHHYGFEVA GVLLDNPEAE LSAGDLAALV ESVKASGVPA IFSESQFNQK TAQLLADEAG
IETIAVLYTD TLGSDTATSY IDMMRYNMNT IVAALK