Gene Haur_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0889 
Symbol 
ID5732790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1015172 
End bp1016947 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content50% 
IMG OID641278021 
Producthypothetical protein 
Protein accessionYP_001543665 
Protein GI159897418 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGAT GGCTCTATCG CTTACATCTA TGGTTGGTTC TGCTGCTGTT GATTGCGGCC 
TGTACCCAAG TTGGCGAATC GGGCGGCAAC CAAACGCTTA GCTTACGTAC CCTAACCGGC
AATGCTGCGG TTTTTGGCAC GATTGAGTTG GCGATTGATA CCACTATCAC CGTCGCCAAT
CCTTACGATC CAAATCAAAT CGATCTGATG GTGAGTTTTA TCTCAGCAAC CGGCCAAATC
TATCGTGTGC CAGCCTTTTG GTATCAAGAT TTTGATCAAC TTTCGCTGCA ACCCAAAGGC
AACCCTGAGT GGCGGGTGCG TTTCACGCCG AGCGAACCAG GTGCATGGCA AGTCAAGGCC
GAGCTAGCCA AGCCAGCGCT GAGCAGCGAC GTGATCACGA TTGAAGTTTC AGCGAATAAG
CAATCGCCAG GCTTTGTACG GATCAACACC AGCAATCCGC GCTATTTCGC TCGCCAAGAT
GGCACCTTCT TTATGCCAAT CGGCCTCAAT TTGGGCTGGT CAACCCAACA AGGCACGGGC
ATTTTGCGCG AATATGAACA CTGGTTTGAT CAATTAAGCA AAAACGGTGG CAATATTGCG
CGAATTTGGA TGGCCTCGTG GTCGTTTGGC ATCGAATGGC AAGATACCGG TTTAGGCGAT
TATTCCAAAC GCATGCAACA AGCATGGATG CTTGACCAAA TTTTCAAATT GGCCGAACAG
CGCAACATCA CAATTATGTT AACCCTTATC AACCATGGCG CATTTAGTAC CAGCACTGAT
TCAGAGTGGG CTAGTAATCC GTATAACGCT GCGAATGGCG GGCCAATTGC CGAGCCACGC
TTGTTTGCCA CCGATATTCA ATCGCGTGAA GTGTTCAAGC ATCGAGTGCG TTACATTGCG
GCTCGTTGGG CACATTCGCC TAGCCTATTC GCATGGGAAT GGTGGAACGA AGCCAACTGG
ACACCAATTA ATGATGCTTT GATGCAACCA TGGATCAGCG AAATGACCCG TCATTTGGCG
CAGTTTGATC CCTATCAACA TTTGGTTTCA ACCAGCTATG CCAGCAATAC CAGTACCTCG
ATGTGGGTAC AACCAGAGAT CAACTTCACC CAACACCACG ATTATACAGG CCGCGATTTA
GGACAAGCCT TCCCCTTGGT GATCCGTGAG TTGAATGCGG CAGCACCACA AAAACCAGCC
TTGGTCAGCG AACTTGGCTA TGCTGGCACT GGGCGCGACG AGGTAATCAA TCGGGATGTT
TGGCAGTTTC ATCAAGGCTT GTGGGCTGCA CCATTCAGTG GCTTTGCTGG CAGCGGCATG
TATTGGTGGT GGGACACCTT GGTCGATCCC GACAACTTGT GGAGCGAATA CAGCAAGTTG
GCCGAATTTT TCAAAGACCA AGATCTCACG ATCTACAACC CAGTTGTGGC TCAAATTTCG
CCGTTGAAGG CGCGGGCCTT AGCCTTACAA ACGAAATCGC AGGCGTTAGT CTGGGTGCGC
AGCAACGAAT ATGAGCCTGA AGCATTAACC AAAGCCTATG AAGAAGCGCT CAAAAAACGT
GAATTTAACG ATACATGGGA ATATGTACCG CCGACTTACG CCGATTTGAC GCTTAAGTTG
AATGGGCTAG AAGCCGGAAA CTACCAAGCA ACCTGGTACG ACCCGCAAAC TGGCACATGG
TCGCAACCAA CGACGGTAAC CCTTGAAGCT AACCAATCCA GTATTGCAGT TCCAAGCTTC
AACTACGATT TAGCCTTGAA ATTAGTCAAG CAATAA
 
Protein sequence
MRRWLYRLHL WLVLLLLIAA CTQVGESGGN QTLSLRTLTG NAAVFGTIEL AIDTTITVAN 
PYDPNQIDLM VSFISATGQI YRVPAFWYQD FDQLSLQPKG NPEWRVRFTP SEPGAWQVKA
ELAKPALSSD VITIEVSANK QSPGFVRINT SNPRYFARQD GTFFMPIGLN LGWSTQQGTG
ILREYEHWFD QLSKNGGNIA RIWMASWSFG IEWQDTGLGD YSKRMQQAWM LDQIFKLAEQ
RNITIMLTLI NHGAFSTSTD SEWASNPYNA ANGGPIAEPR LFATDIQSRE VFKHRVRYIA
ARWAHSPSLF AWEWWNEANW TPINDALMQP WISEMTRHLA QFDPYQHLVS TSYASNTSTS
MWVQPEINFT QHHDYTGRDL GQAFPLVIRE LNAAAPQKPA LVSELGYAGT GRDEVINRDV
WQFHQGLWAA PFSGFAGSGM YWWWDTLVDP DNLWSEYSKL AEFFKDQDLT IYNPVVAQIS
PLKARALALQ TKSQALVWVR SNEYEPEALT KAYEEALKKR EFNDTWEYVP PTYADLTLKL
NGLEAGNYQA TWYDPQTGTW SQPTTVTLEA NQSSIAVPSF NYDLALKLVK Q