Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3589 |
Symbol | |
ID | 5735450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4517718 |
End bp | 4518659 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280738 |
Product | hypothetical protein |
Protein accession | YP_001546353 |
Protein GI | 159900106 |
COG category | [S] Function unknown |
COG ID | [COG2339] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCTTG CAATTGTTTT GACGGTATTG CCAACGCTGC TGTACGTGAC CTTTTTGTGG TGGCTTGATC GCTATGAAAA AGAACCATGG CCGCTGCTGT TAGCAGTGTT TGTATGGGGT GCGGTGCCTG CCGTGGCCCT CGCGTTGTTG GCTGAATTGC CCAATACTGC TTCAACTTTA GGTTTTGACC GTGGCAGCCA ACCAGTTTGG TATGCTCCGT TGGTCGAAGA ACCACTCAAA GCCTTGGCAT TGTTTGGCAT TTATATGTTC TTTCGCTATG AGTTCGATGG CGTGCTCGAT GGCATTATTT ATGGCGCACT GATTGGTTTT GGCTTTGCTA TGACCGAAAA TGCGATTTAT TTTGCTAGTC GTGGCGGCTC GATTACTTCA TTACTGTGGC TGCGTGTGGT GCTATTTGGC TTTAATCATG CCTTCTACAC CAGCATCATT GGGGTGGCGT TGGGCATGAT TCGCTACGAG CGCCGCCGTT GGGTGGTGGT TGTGATGCAG CCATTGAGCC TGATTTTGGC TGTCGTGTTC CACGCACTGC ACAATATGAC GATTAGCTAT CGCTTTCCTG GTGTATTGAT CGCATGGTTG ATCAGCAGCG GCGGCGTGTT AATTGTGGTG TTGGTGGCAA TCATCGCCTG GCGTAAAGAG CGCTATTGGA TCGATCTAGA ATTGGTTGAG GAAATTCGCT GTGGATTTAT CGACCAAGCG ACCTACCAAA CCGTGCGGTC AACCCGCCGT CGGGTGCAAA GCCAATGGGA AGCGCTCTTT CGCGGCGGCT GGCGAGCAGT GCAAACGGTG CGTACTCGCC ATCATTTACT CACTGAGTTG GCATTCTTGA AATATCAGCT GCGGATCGGC GATGTGCATT GTCGTCCCGC CGATTTATAC CCGCTGCGCC GCCGAATTCT CGAAGTTCAC GAAGATTTCT AG
|
Protein sequence | MWLAIVLTVL PTLLYVTFLW WLDRYEKEPW PLLLAVFVWG AVPAVALALL AELPNTASTL GFDRGSQPVW YAPLVEEPLK ALALFGIYMF FRYEFDGVLD GIIYGALIGF GFAMTENAIY FASRGGSITS LLWLRVVLFG FNHAFYTSII GVALGMIRYE RRRWVVVVMQ PLSLILAVVF HALHNMTISY RFPGVLIAWL ISSGGVLIVV LVAIIAWRKE RYWIDLELVE EIRCGFIDQA TYQTVRSTRR RVQSQWEALF RGGWRAVQTV RTRHHLLTEL AFLKYQLRIG DVHCRPADLY PLRRRILEVH EDF
|
| |