Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2624 |
Symbol | |
ID | 5734502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3366360 |
End bp | 3367499 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641279764 |
Product | hypothetical protein |
Protein accession | YP_001545390 |
Protein GI | 159899143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000177581 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATCG CCGCATTATT GGCCGAGGCT CGCCAAGCCT TGAGTGTAGG AGATCGTGCC AAAGCCAAAC AGTTCGCTTC ACAAGCCTTG CGACTTGATA GCAAGCACAT TGAAACATGG CTTTTACTGG CCGATTTGGT TGAAGTTCCC GCGCAAAAAC GCGATTGTTT TCAACGAATT TTGGCCATCG ATCCTACCAA TGCACAAGCC AAACAAGCCC TGAACCAACT TGATGCACCA GCCCCAGTTG CTGCCGCCAG CTTTGAATTA CCCAAATCGG CGTGGTTTGG CAGCAAATCA GCTGAGCCAG AACTAACCGC CCAAGTTGGT GGGCCGATCG CAACAGCACC ATTAGCAACC AACGCACCGA GTGGCATTCG ACCACTCGGT TCGAGCCAAG CTGCCTCATT AGTTGAATTG AGCAATACGG CTGTGGGTGG CACCACCCCA ACTGGCTATG TTCAGCCAAT TCAGCCTAGC TATAACCAAA CAAATTATCC ACAACCCAAC CAAGCCTATA GTTTGCCAGC AGCAAGTTAT GGCCAAGCCC AGCCACAATA TGGTCAAGTG ACCTACCCAT ATGTTGAGCC AAGTGGCGCA GCCAAATTCT TGAAATGGCT AGTTTTTGCA ACATTAGGCT TAATAGTAGT TTGTGCAGGG ATTGGCTTGA TGGTCAATTT TGGCAAAGAG CCACCGCCAA CGCCCAAAGT CAGTGCCCAA GATCGCACGA CAGCGATGCT TGGTGATACC ATGAAGATAA TCGTTAATCC CGATTTTTAT GATAAAAGCA AACAGAAAGC CATTGTTGAG CCTTATTTGA ATAACTATTT TATTGAGGCT GCCCGTACTA ATACTGGACG GTTAGACCAA ACTGTTCTTG ATAGCTTCGA TAATGGAGTA GATCAACAGG CCTTTCTGGC GGCTCTACCT TATATCAAAG ATGCCTATGT TACGCCAACC CAGTATACGG TAGAATCACA AACGAATGAG ATGATTACAC TCAAGTTAAT CAGTGGAGAT CTTGTATTTA CTTTTCATAA TGGTAAATTT CTCCAAGTTC CATTAAAAGA TACCTACGAT ACATTTACCT GGGTCAATGA TGATGGAAGA TGGTATCTTG CTGGGATAAG TTTAAAATAA
|
Protein sequence | MQIAALLAEA RQALSVGDRA KAKQFASQAL RLDSKHIETW LLLADLVEVP AQKRDCFQRI LAIDPTNAQA KQALNQLDAP APVAAASFEL PKSAWFGSKS AEPELTAQVG GPIATAPLAT NAPSGIRPLG SSQAASLVEL SNTAVGGTTP TGYVQPIQPS YNQTNYPQPN QAYSLPAASY GQAQPQYGQV TYPYVEPSGA AKFLKWLVFA TLGLIVVCAG IGLMVNFGKE PPPTPKVSAQ DRTTAMLGDT MKIIVNPDFY DKSKQKAIVE PYLNNYFIEA ARTNTGRLDQ TVLDSFDNGV DQQAFLAALP YIKDAYVTPT QYTVESQTNE MITLKLISGD LVFTFHNGKF LQVPLKDTYD TFTWVNDDGR WYLAGISLK
|
| |