Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4678 |
Symbol | |
ID | 5736525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5975906 |
End bp | 5976922 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281842 |
Product | hypothetical protein |
Protein accession | YP_001547437 |
Protein GI | 159901190 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTGTCATGCT GATTTTAGGG TTATTGCTGG GGTTAACCCT TTGGCCCAGA GCGGCCTCAG CTCATGAAGT CTGCTTCCCC GAAGATCAAA GCCCTCACTG TTTGAGTGAC CCATTTAGCG ATTATTGGGA AGCTAACGGC GGCCTGCCCG TTTTTGGTTA TCCAATTACC ACGGTCAACC GCGAACAAAA TCGCGAAACT GGCAAAGTCT ATCGGACCCA ATGGATGGAA CGCAATCGTT TTGAGTTGCA TCCGGAAAAT ACGGGCACGC CTTATGAGGT TTTGCTGGGT TTATTGGGCA AAGATCGCTT GGCACAACTG GGCCGCCAAC CTGATTCAGC AGAATCTGGG CCAAAGGCTG GCTGTTTGTG GTTCAAAGAA ACTGGGCACA ATGTCTGTGA TCAAGGCAAT GGGATTGGCT TCAAAAGCTA TTGGCAAGCC AATGGCTTGA AAATCGATGG ATTGGATGCC TATGCACGTT CGTTACAGTT GTTTGGTTTG CCCTTGACTG AGCCAAAAAT GGAGCGGAAC AGCAGTGGCG ATTTAGTCGA AACCCAATGG TTTGAGCGTG CTCGCTTCGA GTGGCATCCC AATAATCCCA ATAATTTTAA GGTGTTGTTG GGCTTGCTCG GCAAAGAGGT GCGCGGAACC ACTCCACCAA CCACGCCGCC AACCACTTCA CCGCCAACGA CCGCCAATTG CACCATGAAC GCGCCAGCAG TCGCCGAAGG CGCACAGGCT TGGGTGGTCT ATCCCGAAAT TGCTAGCAAT ACAACCCAAA CGGTCTGTGT GCGCTTGACG ATTAGTGCCA AGCCTAAGAG CGGGGCAACT GCCACCGTTG AAATCGATTT GGCCAGCGGA ACCCAATCGC TGATTGGCTT GACCGATAGT AGCGGCATTG CCACGATTCA GTTCAAAGTT GGCAGCCAAT CAAGTGGGCG GCGCAATGAT GTACGGGCCA GTTTCAGCAC AGGTCAAACT GCTACAACCA GCTTTATAAT CAAATAA
|
Protein sequence | MPKRVMLILG LLLGLTLWPR AASAHEVCFP EDQSPHCLSD PFSDYWEANG GLPVFGYPIT TVNREQNRET GKVYRTQWME RNRFELHPEN TGTPYEVLLG LLGKDRLAQL GRQPDSAESG PKAGCLWFKE TGHNVCDQGN GIGFKSYWQA NGLKIDGLDA YARSLQLFGL PLTEPKMERN SSGDLVETQW FERARFEWHP NNPNNFKVLL GLLGKEVRGT TPPTTPPTTS PPTTANCTMN APAVAEGAQA WVVYPEIASN TTQTVCVRLT ISAKPKSGAT ATVEIDLASG TQSLIGLTDS SGIATIQFKV GSQSSGRRND VRASFSTGQT ATTSFIIK
|
| |