Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2023 |
Symbol | |
ID | 5733912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2514968 |
End bp | 2516194 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279167 |
Product | hypothetical protein |
Protein accession | YP_001544794 |
Protein GI | 159898547 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCT TAGTGCTCAA TCGTCAAAAG CCTCACTTAG CTCCGTTTGG CGATTGGCTT GGCGATTTAG TGCCACAGGC CCGCTTATTT ACGGCTGCCA ACCGTGTGCA GGGCTTTCAA GGGTTTGCGG CGATTCAGCC ATTTGAGAAC TATGAAGACA GTGGCCTGAT TGAATTTGAG GCTTTACGGC TGCATCGTCA ATCGCCAATC GAGCGAATTG TTGCAACTTC AGAGGTCGAT ATTCTGCGTG CAGGCCGCTT ACGTAGCTAT CTTGGGTTGC CAGGCCAACA AGCCGATAGT GCCTTGGCCT TTCGCAATAA AGTTGTGATG AAGCAACACC TGGTTAATCG CACTCAGCTG GTCAATATCC CAATCTTTCA GGCGATCAAC GAGCCGTTCG ATATCATTCA ATTTATCGAA CAGCATGGCT ACCCAGTAAT CGTCAAACCA GATGATGGCA GTGGCTCGCT GGGGGCAAAA ATGCTGGCAA ACGAGGATGA TCTGGCCCAG TTTTTACAAC AGCCGCTGCC CCGTGGTTTA GAAATTGAGT GCTTTATCCA AGGCGATCAA TATCATGTCG ATGGATTATT GGTCGATAAC GAGGTCTGTT TCTGCTGGCC ATCGCAATAT CTTGGCAATG GTTTATCCTT TACCCAAGGC TGGTTTACTG CGAGCCAGAT GCTTCGGCCC GAACATCACT TGACCCAGCG CTTAATCGCA GCGGCCAAAG AAGTGTTGGC TTTGCTGCCA ACTCCACCCG TCACCAGCTT TCACCTTGAG TTGTTTCATA CTCCTGGCGA TGAGCTGTTC TTTTGCGAAA TTGCCAGCCG CACTGGTGGC GGTATGATCA ACGGAACAAT TGAGCAGGCA TTTGGGATTA ATCTAAATCA ACTCTTTATC CAAGGCCAAG CTGGCATGCC GATTGATACG AGCCGATTAA GGGCGATCAC CCAACCCAAG AAGATTGTCG GTTGGGGGTT GGTTCCACCG CAAGCTGGGG TTTTTCGCGG CTATCGCCAA GCAAAACCAC CCCAACCATG GGTCCTCCAC TTCGATTGGA GCATTCAGGC AGGTACGCAC TCACAACCAG CGCAAATGAG TGTTGATCAA GTCGGCGGGT TTATTGTTGA TCTGACTGAT GCTCCTAACC CCGAAGAACG CTTGATTGAG GTTTGGCGCT GGGCCGAGCA CCAAGCACTG TGGGAGCCAG CAGGAGTAAA TGCATGA
|
Protein sequence | MKILVLNRQK PHLAPFGDWL GDLVPQARLF TAANRVQGFQ GFAAIQPFEN YEDSGLIEFE ALRLHRQSPI ERIVATSEVD ILRAGRLRSY LGLPGQQADS ALAFRNKVVM KQHLVNRTQL VNIPIFQAIN EPFDIIQFIE QHGYPVIVKP DDGSGSLGAK MLANEDDLAQ FLQQPLPRGL EIECFIQGDQ YHVDGLLVDN EVCFCWPSQY LGNGLSFTQG WFTASQMLRP EHHLTQRLIA AAKEVLALLP TPPVTSFHLE LFHTPGDELF FCEIASRTGG GMINGTIEQA FGINLNQLFI QGQAGMPIDT SRLRAITQPK KIVGWGLVPP QAGVFRGYRQ AKPPQPWVLH FDWSIQAGTH SQPAQMSVDQ VGGFIVDLTD APNPEERLIE VWRWAEHQAL WEPAGVNA
|
| |