Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3818 |
Symbol | |
ID | 5735682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4792284 |
End bp | 4794206 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280970 |
Product | hypothetical protein |
Protein accession | YP_001546582 |
Protein GI | 159900335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00672536 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTGG GTCGATATGC TCGGCCATTG CTGCTGAGTA GCGTGGTTGC TGGGCTATTG TTATGGCTTG GTTGGCAGCG TTTAGTGCCA TTCAACCTAG CAATTGGCGG CGATTTAGTC ATTGGCGAGT CGGGTGAGCA ATTTGCCTTG ATGTACGATC AGCCCTATCT TGAGCATGTG CATGCCCCCG AACCGGCCAC GGTTGATCTC ACCACCACTG AAACCTATCG CTGGACCCAA CCCGAATCAG CGATAACTGT ACCGTATCTC AATGGCAGTG CTCATATCGT TCGCTTGAGT TTAGCGCCGC CATCAGTGCC ACAAACCCCG TTTTTGCTGC AAGCCAATGG CGCAGCGGTT CGTACCGACC TGCCACCAGG CCAACGCACG TTGCATCTGT TCGCTCCTGC TAGTAGCGAT GGCAGTTTAA CCCTCGAATT ACAAGCCCCA ACCTACAATG CTAGCCCTGA CCCACGTTTG CTGGGAGTGG TGTTGTATCG GATGCAGGCT CAACCGTTAA GCCACGACTG GTTTCTGCCA TGGGCCGCTT GGATCGATTT GCTGCTCGTG GTAATTGTGG TTGGGCTTGG TGCTGCTTTG GCGGGTTTGG CTCCGTTGAC TGCGGCTGGC GCGATCTTGG TGACCAGCAG CGGATTAAGT GTTTTGTTGG CAACAGTGCG GACGGTGATT ACGCTCGATA CTGGACAGTT GCGCACAATA AGCTTGGCTT GTTTGCTGGT GGCTTGGCTT GGGCGTTGGT TGGCTGAGCG CCGCCAAAAC CCCGAAATCG CCTTAGTTGC TGGAATGACC GCCTTGGGTT TGGCGCTGCG CTTGATCGGC ATTCGTCATC CCCAAACCAA CTTCAGTGAC CTCTTGTTGA ATGTTAATAA TCTGGCGAGT GTGGGGAGCG GCGATTTACT ATTTACCGAA GGTTTGCCTT GTGCCGCAGG TGCTGGCCGT TCGCCCTATC CTCCAGGCAC GTATCTCATC ACGCAACCGT TAAGTTTATT GCTGCCCGCC AGCATGAATC GCGGCATTTT GATTCAGATA GTTGGTGCTG TTGCCGATGC GTTGGTGATT CCGTTATTGT GGTGGCTGAT TGATCGCACC CGTGACGCGC GAACGCCAGC ACGGGCGGCG TTATGGGCTG CCAGCCTCTA TCTTGCTCCC TTGGCGATGC TGCGGGCAAT GGTGATTGGT GAATGGAGCA ATGTGCTGGG CCAGGCAATT GCCATGCCAA TCTTGGCGTG GCTGGGATTA TGGCTTGCCA GCAACCAGCC GCGAGCATGG CAACCAGCCC TGATTGCTGG CTTGACGATC GCCGCCTTAC AGCATAGTGG CACAATGTTA TCGCTCGGGT TGTGGGGCGT GGCCTTGGCG GGATTCTTGG TTTGGCAAAA ACAATGGCAA GTGCTGGGTC GCTTGGTGGT GGTGGGGACA AGTGCCGTTG TGTTGGCGGT TGGCTTGTAT TACAGCAATT TCTTGGGCGA TCCAACCCTG GCCAATAATG GCGTGATTTG CCCAGCGCCA CGTCCGTTTG ACCAAAAATT GTGGGGCGTA GTTTGGAACG ATCTGATTGC GCTTGATGGG CGGGTTCCGG CATGGTTTTG GTTGGTGGGT TTAGGCGGTG CGTTTAGTTT GCGCCAAGGG TTATCACGGC TAGCAACTCC AATTTGGGCT TGGCTGGCAA CCTTCGTGCT TTCGCTTAGC TCGTTGCTTT GGTCGGAGCA AACTGTGCGT TGGTGGCTGT TTATCTTGCC CGCCTTAGCT TTGAGTGGTG GCGTAGGTTT AGCGACTTTA GCCCAGCGTG GCCGTTTCGG GCGGGTAGCT GCGATTGCCG CGAGTTTATT CATCATTGCT GCTTCGTTAG CCCTCTGGAC ACGCTTTATT ATCGAGTATC GCACGGGGGC GTTTGTTCCA TAA
|
Protein sequence | MALGRYARPL LLSSVVAGLL LWLGWQRLVP FNLAIGGDLV IGESGEQFAL MYDQPYLEHV HAPEPATVDL TTTETYRWTQ PESAITVPYL NGSAHIVRLS LAPPSVPQTP FLLQANGAAV RTDLPPGQRT LHLFAPASSD GSLTLELQAP TYNASPDPRL LGVVLYRMQA QPLSHDWFLP WAAWIDLLLV VIVVGLGAAL AGLAPLTAAG AILVTSSGLS VLLATVRTVI TLDTGQLRTI SLACLLVAWL GRWLAERRQN PEIALVAGMT ALGLALRLIG IRHPQTNFSD LLLNVNNLAS VGSGDLLFTE GLPCAAGAGR SPYPPGTYLI TQPLSLLLPA SMNRGILIQI VGAVADALVI PLLWWLIDRT RDARTPARAA LWAASLYLAP LAMLRAMVIG EWSNVLGQAI AMPILAWLGL WLASNQPRAW QPALIAGLTI AALQHSGTML SLGLWGVALA GFLVWQKQWQ VLGRLVVVGT SAVVLAVGLY YSNFLGDPTL ANNGVICPAP RPFDQKLWGV VWNDLIALDG RVPAWFWLVG LGGAFSLRQG LSRLATPIWA WLATFVLSLS SLLWSEQTVR WWLFILPALA LSGGVGLATL AQRGRFGRVA AIAASLFIIA ASLALWTRFI IEYRTGAFVP
|
| |