Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5068 |
Symbol | |
ID | 5737026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 82056 |
End bp | 83732 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641282233 |
Product | hypothetical protein |
Protein accession | YP_001547824 |
Protein GI | 159901578 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCAA CCCACCCGAG CGGTGGGGCG GAGGTCATGC GCAATCCCGT ATGGCTAGCC GCACCCGACC GATCATATCT CTGGTTCCCG ACCAAACTTG CGGAAACCCT CGCCCATGCC CCGCTCGCGT TAGGCATCTA TGCCCTGATT GCGCGGCGCT GGCTCGCCCA GCATACGTCC GTTCCACTGT CTGCTCACGA TATTCAGGTC TATGACCCGT CACTGACCCG CGCCAAGATT CGCACGGCCT TCGACCAGCT TCTGGCTGGC CACTGGCTCG CCATTCCTAC GCCTCCGCAG CACGGCTGCA AAATCGCCTA TGTGCCCACG TGGGGATGGC AACGCGAAGG CGTGCGCCAG TGGGAACCCG CGCAATCATT CAATCGCGGG CGGGTGGCCA CCCATCGGCT CGACCGCACC CTGCTCGACT ACTACCTTGG CCGGATTGAG CCACGGTCGC ACGGCCAGCC ACTCATTACG CGGTATCTGA CCACGCCCGC CCTGACCTTG ACCGATGTGG GCATCTATAT GCTGCTCGTG GCCGGGATTC CGCATCGCCA TAGCACAGCA ACACTCCAAC ACCGTGGTTT ATGTCAACAG AATGTCCCGC TGGCCGTGCC AAGCATTGCG GAGATTCTGG CGCAGGGGAC GATGAGTCTC CACGGCGCAC AGCGTTTGCA GCTGCTGCCA CGCGGCGGAG TCATGCGGCC CAGCGACCCT GATCCGCGAC CCCCGCTTTT TTTTGTGGAA CCGGACCTGG CGACGATCAT GGTGATGACC ATGGCGACGA ACATGGCGAT GGCGGACAGC GTGTCCGAGG ACGGTTTTGA CGCGTCAGGA AGCAAAAAAA CGGCGGTTGC CCATGATTCA TGGAACGTCA CAGGCAGATT AAGCAAAATA GACAGAGAAG ATCAACCACC AGAAAACACT CTGCATAGCA TACAAAACGG TGGTGGAGTC TTTATTTCTG ATCGAACCAA CGAACGATCA GGGAACGAAA TCGACCAAGA TCTACCAGAA CCCGTTACCA GAACTATTCT GCGCCGTCGC AACATTATGT CAATTCCGGG GACCCCGCAG GTTGCATTGC TGCGCTCACT GGCCATTCGG CCCAAACAAC TTGCCGAGTT TGCCGATATC GATTTGGCCA CGCTGGAAAC GGTCGTGGCC GATGCCCGCC AGCGCACGGG CGTTCGCGAT ATCGGCGGGT GGGTCGTCAG TATCCTGCGG GATATTCAGG ATCACGGCTG GGAACCCGCT GCCGCCAAAT GGCAGATCGA TCAGCCCCGC GACTTCGAGG CCGCGCGCTC CCGGTGGCAC ACGGCCTTGG GCCTTGATGC ACCGCCCGAA GCGGAGTCCG AAGCCGCTGA CGACGCCTGC CGCGAATCCG CCCCGCCCCC CGTCGTGGTT GACTGGGTGG CGCTTGAACC GTTGCTTGAC ACCACGGATA CGGCTGTGTC GCTCGCCGCT CAGGAGCCGC CGCACCGTCC GTTGTGGATA CCCGCAGCCC TCTGGTTGCG GCTCCGTGCC TCGGTGCGGA TGCTGCTGAT CGCCAGTCGC TGCGATAATG GACGTATTAC GGCAGGGGAT GCCTGGCGAC AGGCGCGACT CGCGCTCCCT GCCTATCGCA CGGTGCTTCC GGCCTTCATA CACGCCTGCG AAGCCCTGCG GGAGTAA
|
Protein sequence | MPSTHPSGGA EVMRNPVWLA APDRSYLWFP TKLAETLAHA PLALGIYALI ARRWLAQHTS VPLSAHDIQV YDPSLTRAKI RTAFDQLLAG HWLAIPTPPQ HGCKIAYVPT WGWQREGVRQ WEPAQSFNRG RVATHRLDRT LLDYYLGRIE PRSHGQPLIT RYLTTPALTL TDVGIYMLLV AGIPHRHSTA TLQHRGLCQQ NVPLAVPSIA EILAQGTMSL HGAQRLQLLP RGGVMRPSDP DPRPPLFFVE PDLATIMVMT MATNMAMADS VSEDGFDASG SKKTAVAHDS WNVTGRLSKI DREDQPPENT LHSIQNGGGV FISDRTNERS GNEIDQDLPE PVTRTILRRR NIMSIPGTPQ VALLRSLAIR PKQLAEFADI DLATLETVVA DARQRTGVRD IGGWVVSILR DIQDHGWEPA AAKWQIDQPR DFEAARSRWH TALGLDAPPE AESEAADDAC RESAPPPVVV DWVALEPLLD TTDTAVSLAA QEPPHRPLWI PAALWLRLRA SVRMLLIASR CDNGRITAGD AWRQARLALP AYRTVLPAFI HACEALRE
|
| |