Gene Haur_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1192 
Symbol 
ID5733085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1371697 
End bp1373736 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content50% 
IMG OID641278332 
Producthypothetical protein 
Protein accessionYP_001543968 
Protein GI159897721 
COG category[S] Function unknown 
COG ID[COG1306] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000450505 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTTG TGAGTCAAAC CAAGCGTCGC CTAACAGCTG GTCTGTTTGG CGCTGTTGCG 
TTGGTGCTTG CCGCCTGTGG CAGTAGCAAT CCGCCACTGA CGGGAATTGT AACAGATAGC
TATACCCAAA AGCCCGTTGC TGGAGTAACC ATTCAAGTTG GCGAAGCAAG CGCTACGAGT
GATGCTGATG GTAAATGGAC GATTAATGAA TGGGAAAATA CCAATTCACT CTTAGTTCAA
GCCAGCGATT ATAGCTCAGC TACGCTTAGT TTGGCCGATA AAACTCCTGT CGATGAGCAA
ACTCCGGTAG AAGTTAATCT GACGATTCGT CCCAACACGA TCAGCGGGGT TGTGCTTGAT
CAATACTCAC AACAGCCTGT GGCTGGCGTG ACGGTCAAGG CTGGCACGAG CCAAGCCACC
AGCGGCGCTG ATGGCCGCTA CAAATTGACC GATGTTGCTG AAAAGGCTGA AGTGGTGATT
GTGGCAACCG ATTACACCAG TGCCACCGCT ACCCTCGAAA AACAAACAAG CTACGATGTT
TCGCTGCGCC CAACCAGCTT AACTGGGATC ATCAGCGATA AATATAGCCA AAAACCAGTC
AGCGGAGCCA CGGTCAGCGT TGGTAGCGCC ACGGCCCAAA GCGATGCTGA AGGTCGCTAT
ACAGTTAAGA ACATCGATTT GGATGCACCA GTTGTTTTCA GCGCTACCGA TTACAGCAGC
CAAACCTTAG AATTGCCGCA AGCCGCCTCG TTGGATGTGG TTTTGCGACC TTCGACCGTG
CGTGGCTCAG TCGTCGATAG TACAACAGGC AAGCCGTTGA CCAAGGGCAC GGTTATCGCC
ATGGTCAAGC CATTTGAAGG GGCTGATGAA ACCTATCCCT ACACTGGCAC TGCCGTTACT
ATGGCACGGC TGAATGCCGA TGGCACCTAT GAATTAACCG ATGTGCCTGA AAATGCCCAA
ATTCAGGTGC TCTCGCCTGG GTATCGCAAG GCTTGGACGG CGCTCAGCGA AGGCAAATTT
ACCGCCGATC TAGAAGCTGA AGAATTTGTA GCCAAAGCAA TTTATATTAC CGCAGCAACT
GGCTCGTCAA AAGCTTCATT AAGCGAATTG TTTGATTTGG TTGATCAAAC CGAAGTTAAT
GCGGTGGTGA TCGATATCAA GCTGGATATT GCTGGCGATG TTGGCGGAGT AGGCTATCTC
TCGCAACATC CATTGGTATT GGCCGCCGAA ACCTCATCCG ATTATTTGGA TATGGAATGG
ATTGTGGCCG AAGCTCGCAA GCGCGATATC TACTTAATTG GCCGCATGGC GGTAATGCGC
GATAATCGTT TGGCCGATGC TCACCCCGAA TGGGCCGCCC AAAGCAAGGC CACTGGCGGA
GTTTGGGAAG ATGACGGTGG TCTCAAGTGG CTTGATCCAT TCAACCCCAA CGTCACCGAG
TATAATGTGG GCATTGCCAA AGAAATTGCC GCATTTGGCT TTGATGAAGT ACAATTCGAT
TACATTCGCT TCCCATCGGA TGGCAGCACC AGCAATTTGG TTTTCTCCAA GCCGATTGAT
CCCAAAAATA ATCCGGAAGT GATGTACGAA GCAATTGGCA ATGTGCTCAA ACGCGCTCAT
GGCGATATCA ATGGTTCAGG CGCATTCTTC TCAATCGACG TGTTCGGTTA TGCCACATGG
CGTAATATGT GGGAAATTGG CCAAAGCCTT GAAATTATGG CCGATCACAC CGATTATGTC
TGTGCAATGG TCTATCCTTC GCACTACGAT CGCAATGAGT TGGGCTTCGA TAACGCCGAT
GCCTACCCTT ATGAGATCGT CAAGGATAGT ATCGAAAAAG GCCAAAAGCG CATGGAAGGC
AAATACGCAG TGCAACGACC GTGGCTTCAA GCCTTCACCG CGACATGGCT TGATCCAGTA
ACACCATATG GTCGCACCGA AGTTCGCGCC CAAATGCAAG CAGTCGCCGA AGTCGAAGGC
ACGTATGGCT GGATTCTCTG GAATGCTGCC AATTATTACG ACCCCGACTG GCTCGATTAA
 
Protein sequence
MYFVSQTKRR LTAGLFGAVA LVLAACGSSN PPLTGIVTDS YTQKPVAGVT IQVGEASATS 
DADGKWTINE WENTNSLLVQ ASDYSSATLS LADKTPVDEQ TPVEVNLTIR PNTISGVVLD
QYSQQPVAGV TVKAGTSQAT SGADGRYKLT DVAEKAEVVI VATDYTSATA TLEKQTSYDV
SLRPTSLTGI ISDKYSQKPV SGATVSVGSA TAQSDAEGRY TVKNIDLDAP VVFSATDYSS
QTLELPQAAS LDVVLRPSTV RGSVVDSTTG KPLTKGTVIA MVKPFEGADE TYPYTGTAVT
MARLNADGTY ELTDVPENAQ IQVLSPGYRK AWTALSEGKF TADLEAEEFV AKAIYITAAT
GSSKASLSEL FDLVDQTEVN AVVIDIKLDI AGDVGGVGYL SQHPLVLAAE TSSDYLDMEW
IVAEARKRDI YLIGRMAVMR DNRLADAHPE WAAQSKATGG VWEDDGGLKW LDPFNPNVTE
YNVGIAKEIA AFGFDEVQFD YIRFPSDGST SNLVFSKPID PKNNPEVMYE AIGNVLKRAH
GDINGSGAFF SIDVFGYATW RNMWEIGQSL EIMADHTDYV CAMVYPSHYD RNELGFDNAD
AYPYEIVKDS IEKGQKRMEG KYAVQRPWLQ AFTATWLDPV TPYGRTEVRA QMQAVAEVEG
TYGWILWNAA NYYDPDWLD