Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4999 |
Symbol | |
ID | 5737191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 4 |
End bp | 2412 |
Gene Length | 2409 bp |
Protein Length | 802 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641282166 |
Product | hypothetical protein |
Protein accession | YP_001547757 |
Protein GI | 159901511 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.664639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCTC CACGGGAACC GCACGCATGC GGCCATCCCG TAGGGGATGG TCGGCGCAGG GCGTTCTCGC TATTAACAGG AGTCCCTGCC ATGACCACCC CAACCACCAA ACCTGCCTTT ACCACCATCG TTGCTGGCCC GATTCACCTT GTGGGCACGT CCGAGCAAGT CGCGGCCTAC TACACCCATG GCACGGGCGA TTTGCCGCCA TGGTGGGCCG AATGGGATCA ACTCTTAGGC GACACATGGG CGATGGATAT TCTGGAATCA CCCAGTGATG TCTCGCGCTT TATGCGGGCC GCCGAAGTGC ATCCCACCCT GCCGCGTGTG GGCTTTATCT CGAACTCCAA ACTCAAATTG GAGTCGGGCT ATGTCCTTGG CGCGGAGGAT CGCGACTATA AAAATACGGC CCAGCGCCTG CACCGCTACT GGCAGTCCTT GCCGGAAGCC ATGCAACAGC GCTATGCCCG TTTTGGCTCG CCTGCGACCA TTGCAGGCTT GCTCATGCGC CGTGAGGAGT GGGTCGATAG CCGCTGGGGC AGCAGTGTCC ACGAGGCCGA TACCCGCAGC GCCTTGGCCA AACAGGCCAA AGCCACCACC GAAGGCCGCA TTGCCGCCGA CGCATCCGCA CCCGACCGCC CCAACCAACG GGCGATCCCG CTGATGCGCT ATGGTGGCCT TGCCTGCCCG CGCTGTGGCT GGCTGCAACG CAAAACCGAT GGGGCCGTGC TTGATGCCAA GCAACTCAAG CAACGCGGCC TCGCCAGCGT GACCTGCCCC CAGTGTCACG ACCACCTCGG CCAACAATGC CGCGAACGGG ACAATGTGCA GGATCGCAGC CTCCCAATCT TCCAGTCTGA CGACTGGCAG ACCTACGCCG TCGATGCCAA CGGTCGTCGC GCCATCCCGT GGGGCCAGCG TCCACGCTCG AATCCGCGGA TGGCCCTCGC CTCCTTCATC CAGCGCCGCT ACCCCCAACG GGTCGATCTC TACGTCCATG ATGAGATCCA CGAGGCCAAA GGGGCACGCA CCGCACTGGG CAATGCCTTT GGGGCCATGG TCGCTGCCAG CCGCACCACG GTTGGCATGA CCGGAACCGC CTACGGCGGC ATGGCCTCGA CGCTCTACGA CCTCTTGCTC CGCCTTGGCA ACACCGTCAT TCGGGATCGC TGGGGCTGGA ACAACCGCAG CGCCTTTGTG CGCGACGTGG GGGTCGTCGA TGTGATGGAT AAGGAGATCA CGCGCGCTGC CACCGCTGGC CATTACGACG GGAAAACCCG CACCAGCACC GAGGTTCAGG AACGCGCAGG CATTACCGCC GACCTGATCA CCATCGTCCA AAACTGCACC TATACCGTCC TGCTCAAGGA CATGGGCTTT CAGTTGCCCG ACTACCGTGA GGATGTCGTG CTCTTGAAAC TGCCCATGGA CATGCAGGCG CAGTATCGGC AGATCGAGGA CGAGGGCAAA GCGATCATTG GCAATGGCGG CTACGATGCC CTGTCGGCCT ACCTCCAAGC CACCTTGTCA TGGCCCTACC AGCCATGGCG ACCCAAAACC ATCAGTTCGC AGCTGGTCAA CGAAACAGTC AGAACGCCCG AACTGCCTGC CGAGCGCATC CTGCCCCACC ATACGTGGTT GGCCCAGTAT TGCGCCGCGC AGATTCAGCA AGGACGACGC GTGTTGCTGT TTGCTGAGCA TACGGGCAAC GATGACATTG CCGTCGATCT GGCGGAAAAA GTCACCGCCC TCGCTCACGA GCAGCACCAG ACCACGCTCA AGGTGGCCAT CCTCCGCGCC ACCACCGTGG CTCCGGGGGA ACGCAATGCC TGGTTTACCG AACAGGTCAA CAACGGCGCG AATGTCGTCG TGTGCAACCC GCGCTTGGTC AAAACTGGCC TCAACTTAAT CGCGTGGCCC AGCATTGTGG TCGTCGAGCC GCTGTATAGC CTCTATGATC TCTTTCAGGC CAAACGCCGC GCCTTCCGTC CCACTCAAAC CATGGGCTGT GAGGTGACCT TCTTGGGCTA CGAGCACACG ATGAGTCACC GCGCCTTGGG GGTGGTGGGC CGCAAAGCCG CTGCCGCGAC CATGTTGAGT GGCGATGACA GCGAGGGCGG CATGCTGGAA TTCGATCCGG GCATGAGTTT GCTCCAAGAG TTGGCCAAGC AATTACGCAA CGCCAACCCC ATGGACGATG CCCGTGCCCT GCGCGATCAA TTTCGCAGCG TCGGACAGAC CCTCAAGGCC GAGGCCGAAC GCCCCAGCCT GATCCTGCCG ACTGACCCAA CCCCACCACC CAGCGACACC GCCGCGCAGC CCATCCGTTG GGACGAGCTG TTCACGGTCG TCGATGTGCC GACCCGCCAC ACGGCGCAGG TCGCCCACCA GTTCGCTATG GTGCTGTAA
|
Protein sequence | MNSPREPHAC GHPVGDGRRR AFSLLTGVPA MTTPTTKPAF TTIVAGPIHL VGTSEQVAAY YTHGTGDLPP WWAEWDQLLG DTWAMDILES PSDVSRFMRA AEVHPTLPRV GFISNSKLKL ESGYVLGAED RDYKNTAQRL HRYWQSLPEA MQQRYARFGS PATIAGLLMR REEWVDSRWG SSVHEADTRS ALAKQAKATT EGRIAADASA PDRPNQRAIP LMRYGGLACP RCGWLQRKTD GAVLDAKQLK QRGLASVTCP QCHDHLGQQC RERDNVQDRS LPIFQSDDWQ TYAVDANGRR AIPWGQRPRS NPRMALASFI QRRYPQRVDL YVHDEIHEAK GARTALGNAF GAMVAASRTT VGMTGTAYGG MASTLYDLLL RLGNTVIRDR WGWNNRSAFV RDVGVVDVMD KEITRAATAG HYDGKTRTST EVQERAGITA DLITIVQNCT YTVLLKDMGF QLPDYREDVV LLKLPMDMQA QYRQIEDEGK AIIGNGGYDA LSAYLQATLS WPYQPWRPKT ISSQLVNETV RTPELPAERI LPHHTWLAQY CAAQIQQGRR VLLFAEHTGN DDIAVDLAEK VTALAHEQHQ TTLKVAILRA TTVAPGERNA WFTEQVNNGA NVVVCNPRLV KTGLNLIAWP SIVVVEPLYS LYDLFQAKRR AFRPTQTMGC EVTFLGYEHT MSHRALGVVG RKAAAATMLS GDDSEGGMLE FDPGMSLLQE LAKQLRNANP MDDARALRDQ FRSVGQTLKA EAERPSLILP TDPTPPPSDT AAQPIRWDEL FTVVDVPTRH TAQVAHQFAM VL
|
| |