Gene Haur_2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2856 
Symbol 
ID5736893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3625402 
End bp3627084 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content45% 
IMG OID641279999 
Producthypothetical protein 
Protein accessionYP_001545622 
Protein GI159899375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00133424 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCAA CCAACTGGCA GCGCGAGGTT GAACAAGCCT TAGCTCAAAA CGACCACAAT 
CGAGCCAAAC AGCTTTTAGC CGCCGTGATT CGCCAAAATG TGCATGAACG CGAGGCCTGG
CGCATTTTAG CGAGCATCGT CAATGATCCA GCCCAACAGG CCGAGTGTTT GCGCCGAATT
GCCGCAATTG ATGCCGCCAC GCCAACTGTG ATTAAGCCAA TTAAGCCAAT TGCTGCCGTA
GCACCAAGCA CTGAGCTACC AAGCGATCCA AGCACTAGCC CAACCACGCC CTTGGTTGCT
GTACAGGATA CGCCTAATTT TACGCCATCA AGCATTCAAA CTGAGCCATT ATTTGATTCA
CAAGCTAGTT TCACGCTGCC AGCCAATTAC AGTATGGGCC AAGCAACCCA GAAACTGGAT
CAACCAATGC CGCAACCTGC TCGTTCGTTG CCTAAAATGT TAATGTTGAT TGGAGCAAGT
TTGTGTGGTT TGGGCTTGTT GCTTTTGCTT GGTTTACAAT TTTGGCCATT GTTAAATGAT
ATCAACTCAA CGCCCGCTAG CGAACAACCG CTGCTAGCCG TGGTTGAACT CACGATTGTA
CGATCGGGCG TTGGTCAAGT GCCGATGACC GATCAGGCCA TGGTCTATTT TGAAATTGAA
AATCCCAGTG ATCAAGCACT GTATAACATT CCCTATACCA TGACCTTGAC CAGCCAATTT
GATAAAGTGC TGAAAAATAC CAATACGATT GAATTATTGC TGCCCAAGCA AAAAATCGTG
GTTGTTGATG GGATTTTTAC TGGGGATAGC TTTCGCGCCA AAAGCGTCGA AATTAGCCTA
GGCCAGGGCT ATCCAATCAC AATTGATCAA CCAATTCCAC AAGTTGAGGC GCTTACTCCA
ACCACGTTTG GCATGCCCTT TCGTGGGCGT TTGCAGGAAA GTTGGGGCTA CGAATACCCA
ACTTATCATG TTAGCGCGAT TGCAACCGAT AGTGGTGTGG TTACTTCAAC CTTGGTATCA
ATGCTGTTTT ACGATACGAA CGATCAAATT ATTGGCGCTG GTTCGGGCTA TATCGATTAC
ATTAATAAAA AAACTTCATT GCCAAATACT CGCCGTGGTG AAACGATGGT CTATCCCATT
ATTTGGGGCA CGAGTGAGGT CGATCGGGTT GAAATTGTGC CTGCATTTAC TTTAGCAACA
TTTAGTCAAC AACCCCCTAT TAAGCCTAAA AAGGCTGAGA TCGATCAAAT GTGGCTAGGT
GAATATATTC CAGAGGTTGA TCCACCATAT CATGTACAAG GTATTGTTGA ATATCCAACA
TTACCGCCAA CCTCAGGCTA TCATGCGATG GTGCCAGCAG AATGGGGGCA TTACGATAAT
TTTATTACTG ATGAGGCAAT GGTACATAAC TTAGAGCATG GTGCGGTGAT TATTTTTTAT
AACCCACAAT TAATTACTGC AAATGAGCTT GGCCAATTAG AAGCAACTTT TGAAAATCTT
TATCAACGCG AACATCATAC AATTTTGCAA AAACGCTTCG ATTTGGATGC AACTGTGGCA
ATGACTGCGT GGGAATATCG TTTGATGCTG CGCGATAATG TTAATCTCGA TGCAGTCAAC
ACGTTTTTCA GTGAACATAT TGCCCGTGGC CCTGAATGCG TTAATTTACG CTGTCCTAAT
TAA
 
Protein sequence
MEPTNWQREV EQALAQNDHN RAKQLLAAVI RQNVHEREAW RILASIVNDP AQQAECLRRI 
AAIDAATPTV IKPIKPIAAV APSTELPSDP STSPTTPLVA VQDTPNFTPS SIQTEPLFDS
QASFTLPANY SMGQATQKLD QPMPQPARSL PKMLMLIGAS LCGLGLLLLL GLQFWPLLND
INSTPASEQP LLAVVELTIV RSGVGQVPMT DQAMVYFEIE NPSDQALYNI PYTMTLTSQF
DKVLKNTNTI ELLLPKQKIV VVDGIFTGDS FRAKSVEISL GQGYPITIDQ PIPQVEALTP
TTFGMPFRGR LQESWGYEYP TYHVSAIATD SGVVTSTLVS MLFYDTNDQI IGAGSGYIDY
INKKTSLPNT RRGETMVYPI IWGTSEVDRV EIVPAFTLAT FSQQPPIKPK KAEIDQMWLG
EYIPEVDPPY HVQGIVEYPT LPPTSGYHAM VPAEWGHYDN FITDEAMVHN LEHGAVIIFY
NPQLITANEL GQLEATFENL YQREHHTILQ KRFDLDATVA MTAWEYRLML RDNVNLDAVN
TFFSEHIARG PECVNLRCPN