Gene Haur_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2792 
Symbol 
ID5734673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3549470 
End bp3550660 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID641279935 
Producthypothetical protein 
Protein accessionYP_001545558 
Protein GI159899311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGAAAGA AGCTTAACTA TTTATTCGGT TTAGGATTTG GCCTGGTACT TTTAATTGGC 
TGTGGCTCGG ATGCTGCGCT TGGTGAAGTG ACGCTCAGCG AGCCAATTAT CAACTTGAAT
ACTGGCCCGC GCACCACCAC CATCAGCTAT CTGATTGGGC AGCCAACCAA GGTTTCGATT
TGGCTTGAAA CCAGTAGTGG TGAGCGTTAT GCCTTGCGTC AAGCAGTTAC CCGTGAGCCA
TCCAAGGATG CGTATCAAGT GCTGTTTGAT GGCAGTGTGC CAGTTGATGC TGATACGAAT
CGTTTATTGC CGAGTGGTAG CTACACAGTG GTGATTGAGG CGGAAAATAC TGCGGCCCAA
CGGCTGAATT TGCAAATCGA TCAAGCGCCA AGTGATAGCT TTGACGTGTA TGATCTGCGG
GTTACGCCCA ATCCATTTTC GCCAAATGAT GATGCAGTTG AAGATTTTAC AACCTTCTCG
TATCGTTTGC CGATTACCGC GACCGTCTCG TTGGATGTGA TTGATTCAGC CAATGCGACG
CGCTATCCAA TTCTCAATCG TGAGTTACAA GGCTTGGGCG AACATAGTGA GGCTTGGTCG
GGGCGGCCTG TGGTTGGCGG TATTTTGGCC GCAGGAACCT ATCAATATGA ATTGCGAGCT
GATGATGGCC GTGGCAACCG CGTGACCAAG CGCGGCGATG TGACCCTCAG CAGTGCTGGG
ATCGGGGCGC TCGATGTGCT CAGCGTTGAA ATTGGCCCTG AACAGATCTT ATTAAACGAT
GTGATCACGG TCACGTTCAA GGTCAAAAAT AATAGCGATG TGGCCTTGCG CACCTTTGGG
CCAGCTTCGG GCTACACCTA TAGCACCAAC CAAAGCTACT CGTCGATTGA AAACGAGCAG
TATGCTAACC TTGGCGGCGG CTTATGGCGG GTGGCAATCG ATTGGGATGG CAATGGTGGC
TCGGGTTTTC GCTACCCATT CCGTTGGGCG ATTTCGCCAC GCAGCCCTGA CCAATGGTAC
GACCCCAACA AATTTGATTA CCTGTATCCA GGCGAAGAAG CCACGATTAT TGGGCGCGTG
CAAATCAAAC AACGCGAAGA TCGCATGACC TTTTATGCTG GCGTGGCCCA TGAAGGTGTT
GATTACCCCA CCAATCGACT CAAACCAACC CTAATTCAAG TTTCATTCTA A
 
Protein sequence
MRKKLNYLFG LGFGLVLLIG CGSDAALGEV TLSEPIINLN TGPRTTTISY LIGQPTKVSI 
WLETSSGERY ALRQAVTREP SKDAYQVLFD GSVPVDADTN RLLPSGSYTV VIEAENTAAQ
RLNLQIDQAP SDSFDVYDLR VTPNPFSPND DAVEDFTTFS YRLPITATVS LDVIDSANAT
RYPILNRELQ GLGEHSEAWS GRPVVGGILA AGTYQYELRA DDGRGNRVTK RGDVTLSSAG
IGALDVLSVE IGPEQILLND VITVTFKVKN NSDVALRTFG PASGYTYSTN QSYSSIENEQ
YANLGGGLWR VAIDWDGNGG SGFRYPFRWA ISPRSPDQWY DPNKFDYLYP GEEATIIGRV
QIKQREDRMT FYAGVAHEGV DYPTNRLKPT LIQVSF