Gene Haur_4999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4999 
Symbol 
ID5737191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp
End bp2412 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content62% 
IMG OID641282166 
Producthypothetical protein 
Protein accessionYP_001547757 
Protein GI159901511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.664639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCTC CACGGGAACC GCACGCATGC GGCCATCCCG TAGGGGATGG TCGGCGCAGG 
GCGTTCTCGC TATTAACAGG AGTCCCTGCC ATGACCACCC CAACCACCAA ACCTGCCTTT
ACCACCATCG TTGCTGGCCC GATTCACCTT GTGGGCACGT CCGAGCAAGT CGCGGCCTAC
TACACCCATG GCACGGGCGA TTTGCCGCCA TGGTGGGCCG AATGGGATCA ACTCTTAGGC
GACACATGGG CGATGGATAT TCTGGAATCA CCCAGTGATG TCTCGCGCTT TATGCGGGCC
GCCGAAGTGC ATCCCACCCT GCCGCGTGTG GGCTTTATCT CGAACTCCAA ACTCAAATTG
GAGTCGGGCT ATGTCCTTGG CGCGGAGGAT CGCGACTATA AAAATACGGC CCAGCGCCTG
CACCGCTACT GGCAGTCCTT GCCGGAAGCC ATGCAACAGC GCTATGCCCG TTTTGGCTCG
CCTGCGACCA TTGCAGGCTT GCTCATGCGC CGTGAGGAGT GGGTCGATAG CCGCTGGGGC
AGCAGTGTCC ACGAGGCCGA TACCCGCAGC GCCTTGGCCA AACAGGCCAA AGCCACCACC
GAAGGCCGCA TTGCCGCCGA CGCATCCGCA CCCGACCGCC CCAACCAACG GGCGATCCCG
CTGATGCGCT ATGGTGGCCT TGCCTGCCCG CGCTGTGGCT GGCTGCAACG CAAAACCGAT
GGGGCCGTGC TTGATGCCAA GCAACTCAAG CAACGCGGCC TCGCCAGCGT GACCTGCCCC
CAGTGTCACG ACCACCTCGG CCAACAATGC CGCGAACGGG ACAATGTGCA GGATCGCAGC
CTCCCAATCT TCCAGTCTGA CGACTGGCAG ACCTACGCCG TCGATGCCAA CGGTCGTCGC
GCCATCCCGT GGGGCCAGCG TCCACGCTCG AATCCGCGGA TGGCCCTCGC CTCCTTCATC
CAGCGCCGCT ACCCCCAACG GGTCGATCTC TACGTCCATG ATGAGATCCA CGAGGCCAAA
GGGGCACGCA CCGCACTGGG CAATGCCTTT GGGGCCATGG TCGCTGCCAG CCGCACCACG
GTTGGCATGA CCGGAACCGC CTACGGCGGC ATGGCCTCGA CGCTCTACGA CCTCTTGCTC
CGCCTTGGCA ACACCGTCAT TCGGGATCGC TGGGGCTGGA ACAACCGCAG CGCCTTTGTG
CGCGACGTGG GGGTCGTCGA TGTGATGGAT AAGGAGATCA CGCGCGCTGC CACCGCTGGC
CATTACGACG GGAAAACCCG CACCAGCACC GAGGTTCAGG AACGCGCAGG CATTACCGCC
GACCTGATCA CCATCGTCCA AAACTGCACC TATACCGTCC TGCTCAAGGA CATGGGCTTT
CAGTTGCCCG ACTACCGTGA GGATGTCGTG CTCTTGAAAC TGCCCATGGA CATGCAGGCG
CAGTATCGGC AGATCGAGGA CGAGGGCAAA GCGATCATTG GCAATGGCGG CTACGATGCC
CTGTCGGCCT ACCTCCAAGC CACCTTGTCA TGGCCCTACC AGCCATGGCG ACCCAAAACC
ATCAGTTCGC AGCTGGTCAA CGAAACAGTC AGAACGCCCG AACTGCCTGC CGAGCGCATC
CTGCCCCACC ATACGTGGTT GGCCCAGTAT TGCGCCGCGC AGATTCAGCA AGGACGACGC
GTGTTGCTGT TTGCTGAGCA TACGGGCAAC GATGACATTG CCGTCGATCT GGCGGAAAAA
GTCACCGCCC TCGCTCACGA GCAGCACCAG ACCACGCTCA AGGTGGCCAT CCTCCGCGCC
ACCACCGTGG CTCCGGGGGA ACGCAATGCC TGGTTTACCG AACAGGTCAA CAACGGCGCG
AATGTCGTCG TGTGCAACCC GCGCTTGGTC AAAACTGGCC TCAACTTAAT CGCGTGGCCC
AGCATTGTGG TCGTCGAGCC GCTGTATAGC CTCTATGATC TCTTTCAGGC CAAACGCCGC
GCCTTCCGTC CCACTCAAAC CATGGGCTGT GAGGTGACCT TCTTGGGCTA CGAGCACACG
ATGAGTCACC GCGCCTTGGG GGTGGTGGGC CGCAAAGCCG CTGCCGCGAC CATGTTGAGT
GGCGATGACA GCGAGGGCGG CATGCTGGAA TTCGATCCGG GCATGAGTTT GCTCCAAGAG
TTGGCCAAGC AATTACGCAA CGCCAACCCC ATGGACGATG CCCGTGCCCT GCGCGATCAA
TTTCGCAGCG TCGGACAGAC CCTCAAGGCC GAGGCCGAAC GCCCCAGCCT GATCCTGCCG
ACTGACCCAA CCCCACCACC CAGCGACACC GCCGCGCAGC CCATCCGTTG GGACGAGCTG
TTCACGGTCG TCGATGTGCC GACCCGCCAC ACGGCGCAGG TCGCCCACCA GTTCGCTATG
GTGCTGTAA
 
Protein sequence
MNSPREPHAC GHPVGDGRRR AFSLLTGVPA MTTPTTKPAF TTIVAGPIHL VGTSEQVAAY 
YTHGTGDLPP WWAEWDQLLG DTWAMDILES PSDVSRFMRA AEVHPTLPRV GFISNSKLKL
ESGYVLGAED RDYKNTAQRL HRYWQSLPEA MQQRYARFGS PATIAGLLMR REEWVDSRWG
SSVHEADTRS ALAKQAKATT EGRIAADASA PDRPNQRAIP LMRYGGLACP RCGWLQRKTD
GAVLDAKQLK QRGLASVTCP QCHDHLGQQC RERDNVQDRS LPIFQSDDWQ TYAVDANGRR
AIPWGQRPRS NPRMALASFI QRRYPQRVDL YVHDEIHEAK GARTALGNAF GAMVAASRTT
VGMTGTAYGG MASTLYDLLL RLGNTVIRDR WGWNNRSAFV RDVGVVDVMD KEITRAATAG
HYDGKTRTST EVQERAGITA DLITIVQNCT YTVLLKDMGF QLPDYREDVV LLKLPMDMQA
QYRQIEDEGK AIIGNGGYDA LSAYLQATLS WPYQPWRPKT ISSQLVNETV RTPELPAERI
LPHHTWLAQY CAAQIQQGRR VLLFAEHTGN DDIAVDLAEK VTALAHEQHQ TTLKVAILRA
TTVAPGERNA WFTEQVNNGA NVVVCNPRLV KTGLNLIAWP SIVVVEPLYS LYDLFQAKRR
AFRPTQTMGC EVTFLGYEHT MSHRALGVVG RKAAAATMLS GDDSEGGMLE FDPGMSLLQE
LAKQLRNANP MDDARALRDQ FRSVGQTLKA EAERPSLILP TDPTPPPSDT AAQPIRWDEL
FTVVDVPTRH TAQVAHQFAM VL