Gene Haur_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3997 
Symbol 
ID5735858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5102626 
End bp5104581 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content53% 
IMG OID641281147 
Productphosphate binding protein 
Protein accessionYP_001546757 
Protein GI159900510 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAAAC TGCTGAGTTT AATGCTTGCC TCAATTATGT TGCTGACAAT GCTCGCAGCA 
TGTGGCGGCG ACAGCACTCC AACCACTGCA CCAACTACGG CACCAGCTAC CGCCACTACT
GGCCAAGCTG CTGCAACAAC TGAACCAACC GCTCCTGCTG CTGAAACTCC TACAGTTGAA
GCAACCGCTG AAGTTCCTGC TGGTGGAGAA GTTGACCCAG CAATGGTAAA AGGCGATATC
GTGAGCGCTG GTTCATCAAC GGTCTATCCG TTGAGCGAAG CCGTGGCCGA AATCTTTACC
GAAGATGGCT ACACTGGCAA TATCACGATC GATAGCATCG GCACGGGCGC TGGGTTCGAG
CGCTTCTGTA CCGCTGCCGA AACCGACATC GCCAACGCCA GCCGCGCAAT CAAAGACGAA
GAAGCCAAAG CCTGTGCCGA TAAAGGCCGC GAAGTCGTCG AGTTCCGCGT CGGTACCGAT
GCCTTGGCCG TCGTGGTCAG CAGCAAAAAT ACCTTCGTCA GCAACTTGAC CGAAGCCCAA
GTCGCCGACA TCTTCTCAGG CACCTACAAA ACCTGGGATC AAGTTGATGC CAGCTACCCA
GCCGAAGCGA TCAAACTCTA CAGCCCAGGC ACCGATAGCG GTACCTTCGA CTACTTCGTC
GAACATTTCT ACGCGAAAGA AGGTAAGTTC ATCTTGGGTG CAAACCCACA GTTGAGCGAA
GACGATAACG TGTTGGTGAC CGGGATCGAA GGCGATGCCA ATGCGATCGG CTACTTTGGC
TATGCCTACT ACAACGAAAA CAAAGCTAAG CTCAAAGCCT TGACGATTGA CGGTGTGGAA
CCAACCGAAG CCACCACCGA AGATGGCAGC TATCCGTTGG CTCGTCCGTT GTACATCTAC
TCAGCCAAGA ATATTTTGAC TGAAAAGCCT CAAGTCGCGG CTTTCATCAA CTACTACTTG
ACCAACGTCA ACGATGTTAT TCTTGAAGTA GGCTACTTCC CAGCCAGCGA CGAAGCGTTG
GGCGAAGCTA AAGACGCTTT GGTTAATGCC TTGACGGGTG GCAGCAGCAG CAATACCAAC
ACTGGTAGCG CCGTTGCCCT CGAAGAAGTT GATCCAGCAG CAGTTCAAGG TGATATCGTG
AGCGCTGGTT CATCAACGGT CTATCCGTTG AGCGAAGCCG TGGCCGAAAT CTTCGGCGAA
GATGGCTACA GTGGCAATAT CACGATCGAT AGCATTGGCA CGGGCGCTGG GTTCGAGCGC
TTCTGTACCG CTGCCGAAAC CGACATCGCC AACGCCAGCC GCGCAATCAA AGACGAAGAA
GCCAAAGCCT GTGCCGATAA AGGCCGCGAA GTCGTCGAGT TCCGCGTCGG TACCGATGCT
TTGGCCGTCG TGGTCAGCAG CAAAAATACC TTCGTCACCA ATTTGACCGA AGCTCAAGTC
GCTGACATCT TCTCAGGCAC CTACAAAACC TGGGACCAAG TTGATGCCAG CTACCCAGCC
GAAGCGATCA AACTCTACAG CCCAGGCACC GATAGCGGTA CTTTCGACTA CTTCGTTGAA
CATTTCTACG CGAAAGAAGA AAAATTCATG TTGGGTGCAA ACCCACAGTT GAGCGAAGAC
GATAACGTGT TGGTGACCGG GATCGAAGGC GATGCCAATG CGATCGGCTA CTTTGGCTAT
GCCTACTACA ACGAAAACAA GAGCAAACTC AAAGCGTTGA CGATTGATGG TGTGGAACCA
ACCGAAGCCA CCACCGAAGA TGGCAGCTAT CCGTTGGCTC GTCCGTTGTA CATCTACTCG
GCTAAGAACA TCTTGGCTGA AAAAGCCCAA GTCGCGGCCT TTATCAACTA CTACTTGACC
AACGTCAATG AAGTTATCCT CGAAGTGGGC TACTTCCCAG CGAGTGAAGA AGCATTGAAC
GAAGCCAAGC AAAACCTGCT CGACGCAACC AAGTAA
 
Protein sequence
MRKLLSLMLA SIMLLTMLAA CGGDSTPTTA PTTAPATATT GQAAATTEPT APAAETPTVE 
ATAEVPAGGE VDPAMVKGDI VSAGSSTVYP LSEAVAEIFT EDGYTGNITI DSIGTGAGFE
RFCTAAETDI ANASRAIKDE EAKACADKGR EVVEFRVGTD ALAVVVSSKN TFVSNLTEAQ
VADIFSGTYK TWDQVDASYP AEAIKLYSPG TDSGTFDYFV EHFYAKEGKF ILGANPQLSE
DDNVLVTGIE GDANAIGYFG YAYYNENKAK LKALTIDGVE PTEATTEDGS YPLARPLYIY
SAKNILTEKP QVAAFINYYL TNVNDVILEV GYFPASDEAL GEAKDALVNA LTGGSSSNTN
TGSAVALEEV DPAAVQGDIV SAGSSTVYPL SEAVAEIFGE DGYSGNITID SIGTGAGFER
FCTAAETDIA NASRAIKDEE AKACADKGRE VVEFRVGTDA LAVVVSSKNT FVTNLTEAQV
ADIFSGTYKT WDQVDASYPA EAIKLYSPGT DSGTFDYFVE HFYAKEEKFM LGANPQLSED
DNVLVTGIEG DANAIGYFGY AYYNENKSKL KALTIDGVEP TEATTEDGSY PLARPLYIYS
AKNILAEKAQ VAAFINYYLT NVNEVILEVG YFPASEEALN EAKQNLLDAT K