Gene Haur_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2251 
Symbol 
ID5734138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2869224 
End bp2871887 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content54% 
IMG OID641279392 
Producthypothetical protein 
Protein accessionYP_001545019 
Protein GI159898772 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00410803 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTGA AGCCTGTTCA GCTTCTTATT ATGACGTTCC TGCTTTTGAC CATGGCTCAG 
ACTGGTGCGA GATTTCGTGC CTACGCGCGG TTTGACGCTC CAGTTCAATC AATCCTTGCT
GGTGAATCGC GTGGTAGCAC CCAAATTGGG GCAGCTGGCA TCCAACGCAC CAGCGCTGCA
ATTATGGCCA GCCAAGCCTT AGCCGATCAA CAACCGCAGC CAGTACGTAT AGCAAAACCA
CGCTTTCAGC TTGATCGCCA GCGTTTAGCT GAAAATCCCA ATGCTCCAGC GGTTACCCAA
TGGCCGACTC AAGCAGATAC TTCCCCAGTT GGAGCATCGA ATGCCACGGC CTTGAGCAGT
TTGAGCACTA CTTTTACCGG TGCAACCTTG GCTGATACTA ATCGAATTCC GCCGGGCACC
ATGGGCACGG TTGGGCCAGG CCAATTTGTG GTGGCAATTA ATGGCCGCTT GCGAACCTTC
AATAAAGCGA CGGGTGTTGC TGATGGCGTG ATCGATAGTA CGCTGGAAAC GTTTTTTAGT
TCGGTGATGA CCCCACCGAT TGCCAATAAC ATCACCAACG ATCCACGGAT TCGCTATGAT
CGAATGACCC AGCGCTGGTT TCTGACGGTC ACCGATCTGC CAGGCCTTTC GGGCAATCAA
GTCAATCGGC TGTTGCTGGC GGTTAGCGAT GCTGCCAGTG CGGGAGTTAT TACGCCTAGC
ACGGTCTGGA CATTCTACTT TTTTCAAGGC AGCAACACCG ATGTGATGGA TTATGCGAGC
TTAGGCGTTG ATGTTAATGC CTTGTATATT GGCGCTAACA TGTTCACCAC CGCTGGTGCC
TTTGTTGGCA CCAATGGTTA TGTGGTGCAA AAAAGCTCGA TCCTTGGGGC TGGGCCAATG
GTGGTAACCA CCTTTGCTGG GTTAGTGGCA GGTGGGACTG GCGCTGGCCC ATTTGCCCCA
CAAGGGGTTG ATAATTTTGA CCCAACTGCA ACCGCAGGCT ACTTTGTGGG GGTTGATAAT
GCCACATTTA GCACAATCAT GTTTCGGAGG GTGAGCAATC CAGGCAGTAT GACCCCCACG
ATTTCGGCCA ATATCGCGGT GACCGTACCC ACCACGACCT TTCCTACCCG CGTTCCCCAC
CTAGGCAATA CTGGGGGGGC GAACGGTCAA CTCGATGGCA TTGATGATCG CTTGTATTCA
GCCATGATTC GTAATGGTCG GCTGTGGACG GCGCACAGCT TCAGAACCAA TGCCGCTGGG
GTTGCAAGCA CCGCAACCGG GGCACGTAAC TCTGTGCGTT GGTACGAGTT TCAGAATCTT
GATACGACAC CAACCTTGCG CCAAGCGGGA ACGGTGTTTG ATAATGCTGC TGCCAACCCG
CTCTTTCAAT GGATTCCAAG CGTTGCTGTT TCAGGCCAAG GCCACGCTGT GATGGGCTTC
AGTAGTGCCG GAGCCACAGC CCGCGCCAAT GCATCCATGA CCAGCCGCTT GGCTGGCGAT
ACTCTTGGAA CAATGCAAGC GCCGACGCTC TATACGGCGA GCAGCTTTGA TTACAATCCC
GCCGCTGATC CTGGTGGGGC GGCTGGGCGA CTTTGGGGCA CTACCTCCTA TACGAGCCTC
GACCCCAGCG ATGATATGAC CATGTGGACG ATTCAGCAGT TTACCAATGC CACCGATTCG
TATGGGGTGC AGGTTGTCAA AGTACTAGCT CCGCCGCCAG CAACCCCTAC TACCAGTAAT
CCAGCCTCGG TCGATCAGGG AACGACGACG GATATCATCA TTACTGGTAC TTCGAGCGCT
GGCTCGGGCT TTTATGATCC TGGGGCAGGG TTTAGCAACC GAATTACTGC CAGCATCAAT
GGTGGCGGGG TAACAGTAAA TAGTGTTACC TACAACAGCC CAACCCAAAT TACCCTCAAC
ATCACGGTTG CACCTGGAGC TAGTGCTGGT GCACGCATTG TGACGGTAAC TAACCCAGAT
GGTCAGAGCC TTGACAGCAC CAGCGGGATT GTAACGATCG TTGCAGCCGC GACGGCCACT
CCAACCAACA CACCGACGAA TACACCAACC AACACGCCAA CTAATACCGC GACGAATACA
CCAACCAACA CGCCGACCAA TACACCAACC AATACCGCGA CAGCAACGGT TAGCAACACA
CCAACCAATA CGCCAACCAA TACCGCGACA GCAACGGTTA GTAACACACC AACGAATACG
CCGACTGGTA CACCAACCAA CACGCCTGTG CCCACAACAT TTATTCGATA TTTGCCCTTT
GTAACCATGA GCCGACTTGG CTCAATTGCG ACCCTCGGAT CAGCTGCGAT ACCGACTAAC
CCAATCGCAA CCCCAGGCCT CGTCTTTTTT ACCGGTACAA TCAGTCTACC GACAGTGCTA
CCAAGTGGTG GAACGTATTG GCTTTCATCA AGCCCGAGCA GCCTTGTAGC AGGTTTAGTT
GATGATGCGG TGCTTATACG GGCGGGGCCA ACCGAGCTAT TTCGTTACGA ATATGGAAGC
AATGGACAGC CGCAGGCCGC GTTGGTCGAA GTGCCAGCAA ACATCTTGAT TCCAAGGGCT
GGACAGACGC TAACGGTGGA GTTTGTGGAT CTCTATGGGA GTGTTTATAG CGCGACCCCC
TTATACCTCG TTTGGACACC CTAA
 
Protein sequence
MRLKPVQLLI MTFLLLTMAQ TGARFRAYAR FDAPVQSILA GESRGSTQIG AAGIQRTSAA 
IMASQALADQ QPQPVRIAKP RFQLDRQRLA ENPNAPAVTQ WPTQADTSPV GASNATALSS
LSTTFTGATL ADTNRIPPGT MGTVGPGQFV VAINGRLRTF NKATGVADGV IDSTLETFFS
SVMTPPIANN ITNDPRIRYD RMTQRWFLTV TDLPGLSGNQ VNRLLLAVSD AASAGVITPS
TVWTFYFFQG SNTDVMDYAS LGVDVNALYI GANMFTTAGA FVGTNGYVVQ KSSILGAGPM
VVTTFAGLVA GGTGAGPFAP QGVDNFDPTA TAGYFVGVDN ATFSTIMFRR VSNPGSMTPT
ISANIAVTVP TTTFPTRVPH LGNTGGANGQ LDGIDDRLYS AMIRNGRLWT AHSFRTNAAG
VASTATGARN SVRWYEFQNL DTTPTLRQAG TVFDNAAANP LFQWIPSVAV SGQGHAVMGF
SSAGATARAN ASMTSRLAGD TLGTMQAPTL YTASSFDYNP AADPGGAAGR LWGTTSYTSL
DPSDDMTMWT IQQFTNATDS YGVQVVKVLA PPPATPTTSN PASVDQGTTT DIIITGTSSA
GSGFYDPGAG FSNRITASIN GGGVTVNSVT YNSPTQITLN ITVAPGASAG ARIVTVTNPD
GQSLDSTSGI VTIVAAATAT PTNTPTNTPT NTPTNTATNT PTNTPTNTPT NTATATVSNT
PTNTPTNTAT ATVSNTPTNT PTGTPTNTPV PTTFIRYLPF VTMSRLGSIA TLGSAAIPTN
PIATPGLVFF TGTISLPTVL PSGGTYWLSS SPSSLVAGLV DDAVLIRAGP TELFRYEYGS
NGQPQAALVE VPANILIPRA GQTLTVEFVD LYGSVYSATP LYLVWTP