Gene Haur_5102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5102 
Symbol 
ID5737060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp132027 
End bp133856 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content67% 
IMG OID641282267 
Producthypothetical protein 
Protein accessionYP_001547858 
Protein GI159901612 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTATT CTCCCATGCC TGTCAAGATC CTCGCCGCCC ACGCGTTTGC TGCCGTGCCC 
ATTCGTACCT TGCGCACGCT GGCCCATTAT ACCCACTATC AACGGCTTGC ATCCCAAACC
CGTGCGGGCC TTGCCACTGA CCTGCTGTGC CACTGGACAA CACCGGCTTA TCGTCGGCTG
GTCCGGCAAT CCTTGACCGC TACCGATTAT GCCCTGCTCC ACGCCCTGTG GGCTGGCGAG
CATCCCCTCC CCGACCCTAA TACCCTTGAT CTCTGGCGCT GGCAGGCTCC GTGGCCAACG
CTCGCCAGCC TTTCCTCGCT CCAGCGTTTG GCGGTCTTGG GCTTCCTTCT GCCCATCCGC
ACGCCGCTAG GTCGGCAAAC CGTACTGCTC CGCGATACGA CCCGCTGGCT GCGTCGCGTT
CGTCCCGCGC CCGCACCGCC CACCCCCGCA TCGTTCCAAT CGCTCTTTCA GGCGGTCGCC
GAGCTGCTCA TCCACGGGTC GATTCAGCCA CTGCCTGCCG CGACACCCGG ATCGATGGCG
CGCACGCTTG CCCAGTCTGC TGGCTGGCTC GTCCTGCGGC TGGAGCAGTG GCGCACGACC
CCGCGTGGCA TCGCGTGGGT GCAGGCGAGC CTCGCCGAGC AGGAACGGCT GCTGCAACAC
CAGATCGTGC GCTGTTCCCC GCCGGACAGT GGTCTCCCCG CATGGCGCAA TCCCGATTGG
GCGACGCTCT GGCAGGCCTT CGAAGCCCTC ATGCACGATC ACGCCCCACG GCGGATGTGG
GATGTGCTGG CGCTGCTCTG GGCGCATCCA GCGTGGGGCA CGCTGCCGGA CGACCAGCGC
GGGCGGCTGT TTGGCCAGTG GCTGCGACAG GTACTCCAGC CAGCGGGGGT GGTCAGTCTC
GCGCAGGGCT GGGTGTTCTG GCATGGCTGG TCAGCGTTGA CGGTAACTGC GCCCCCTTTT
GATGGACTCC TGCTGCCCGC CACGCCTCAT CTGCCCCCGC TACTGCGGTG GTGGGCGACC
TACTGGGGCC AGCCGACGCA CCATGGCTGG CGGATCAGCG TGGCAGCGGT GACGGCACGG
GTGCAGCAGG ATGGCGATCT AATGGGGGTG TGGGAGCCAC TCGATGCGTG GTATGCCGCC
CGGCCACCAG CGGTGGAGTC CGTGGTGGCC ACGGTAGCGG CGCGACCACG CGTGCGGCTA
CGGCAGGTGA TGCTGGTGGA AGGCCGTGCT GAGGCGTTGA CGGTATTGGA GCAGCAGCGC
GGCATGCAGG GTGTGGTGCA GGCGGGCTGG GCGGCAACCC ACCGGGTGAT TGCGGCGGAA
GCAGTGGCCC AGGTCGCGCG TGCCGTGGGG TTGCCGTCGC CGCGCCAGTC CGCCCCGCCC
CGCGAGGTCG AAACGCTGGT GTTGGCGTTG CGGATTGCGG CGCAGCACGT ACCGAGCCAT
GCGACGGCGT TCCAAGGGCA AGCGCAGCAG TTGCTCGCCG ATCTGTCGTT TGCGCAGCGA
TGTGTGATCG ACGAGCAATG GGAGGGGTTG CACTCTAGCC TCACGCCGCC GCTGGCCATT
GATGCGGAAC CGCTGGCCGT TGGGCAGCAG CCACGAGCGC AGATCACGGT GGATCATGCA
CGACAGGTGG TGCGAGAGGC AATTCAGGCG GGTCATGCGC TGACGGTGCG CTATTACACG
CCGTCAGCGC ATCGGATCAC GACGCGCACA ATTCGCCCGC TCGAACTGAC CAGCACCGGA
GTCCGTGGCT GGTGCGAGCT GCGGCAGGAA GAGCGAGCTT TTCGCTTTGA TCGGGTGTTG
GCGGTGACAG TCCACCATGA ATCGGGGTAA
 
Protein sequence
MPYSPMPVKI LAAHAFAAVP IRTLRTLAHY THYQRLASQT RAGLATDLLC HWTTPAYRRL 
VRQSLTATDY ALLHALWAGE HPLPDPNTLD LWRWQAPWPT LASLSSLQRL AVLGFLLPIR
TPLGRQTVLL RDTTRWLRRV RPAPAPPTPA SFQSLFQAVA ELLIHGSIQP LPAATPGSMA
RTLAQSAGWL VLRLEQWRTT PRGIAWVQAS LAEQERLLQH QIVRCSPPDS GLPAWRNPDW
ATLWQAFEAL MHDHAPRRMW DVLALLWAHP AWGTLPDDQR GRLFGQWLRQ VLQPAGVVSL
AQGWVFWHGW SALTVTAPPF DGLLLPATPH LPPLLRWWAT YWGQPTHHGW RISVAAVTAR
VQQDGDLMGV WEPLDAWYAA RPPAVESVVA TVAARPRVRL RQVMLVEGRA EALTVLEQQR
GMQGVVQAGW AATHRVIAAE AVAQVARAVG LPSPRQSAPP REVETLVLAL RIAAQHVPSH
ATAFQGQAQQ LLADLSFAQR CVIDEQWEGL HSSLTPPLAI DAEPLAVGQQ PRAQITVDHA
RQVVREAIQA GHALTVRYYT PSAHRITTRT IRPLELTSTG VRGWCELRQE ERAFRFDRVL
AVTVHHESG