Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1702 |
Symbol | |
ID | 5733589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1979863 |
End bp | 1981395 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641278844 |
Product | hypothetical protein |
Protein accession | YP_001544473 |
Protein GI | 159898226 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTTT CGCCAACATT TGCGGTACAG TTAAGCCAAC TCAAACAAAC AGCATCGATG TCAATCCGAG TCTTAAGCCA AAAAACCGCC ATCCCCGAAG ATACGCTCGA CGATTGGTTA AAAGGCAAAA GTCGGCCACG CAATTGGGAA CGGGTGATCA TCGTTGCGGC GGCACTCCAA GCCAATTATC AGCAAGCGAA CAACCTCTTA CAAGCCATCA AAACTTCACC ACTTGAACTA CTACCATGGC ATCAAATTGA ATATTTTATT CAGCCACGTG AACAACAGAT TGGCACAAAT GTGCATAAGC AAACCATTAA TCCAATTTTA GCAATGGTCC AAACATGGTA TGAGGCGCAA ATCCAATTGC TGCAACAGGA GGTAAAGGCT GATTCAGTTG ATCACGTAGA AGCACATAAT CCTCAATTAG CACCAACAAC TGAAGTTAAG CCAAATCCAA GTGATCCAAT TGCTATCATA CACACGGAAA TTCACGAGCA AACAAAACAT CAAAAACATA TAAATCTAAG TCTACCCAAA AAACGTTTGT TGTGGTTATT TGGCATTAGT AGCATTACGA TTATGCTCAT GGGGCTGAAT CTGCAACATG CGAAATCAAA CGATCAATCA GTAATAGACA TTCCATCGCA ACACAATTCA AATCTTACTA ACTCCATGAT TACAATTCCG GCTGGATTTT TCATTCAGGG AAGTAATTAT GCTGATATCG CATACTATGC TCAATTATGT ATTGATTATG GTGCTGCCTG TACAGAATTA GAGTTTGATG ATGAATTTGA TCAAAATGGC CAAGCGCGTC AGGTATTTCT GAATAGCTAC CGCATTGATA AATATGAAGT AACGAATGCT CAATTTGCCC AATTCGTTGA GCAAACTCAG TATATAACCT ACGCTGAACG CCAAGGAGAG AGCATGATTC TTGAGGTTAT CGAAACGGCT GGCTCTAAGG AAACACTGAA CTTTAGCGCG ATTAAAGGCG CTTTTTGGAA ACAACCATAC GGCCCCAATT CATCAATTGA CGACAAAGCC GATTATCCAG TCATTCATAT TCACTATGAA GATGCAGTGG CGTACTGCAC AGCCAAGCAT AAACGATTGC CCACCGAAGC CGAGTGGGAG AAAGCGGCGC GAGGCGTTGA AGGGTGGCGA TTTCCGTGGG GCAATGAGTG GAAGTCTGGC TTAAGCAATC ATGCGATTCC GCTGCGATCC CATATTTTAC AAGTTCGTGG TTTACAAGCA ATTGGCCAAT CTCCGCAAAG TATCAGCCCA TATGGGGTAC ACGACCTCTT AGGGAACGTA AGCGAGTGGA CTGCCGATTG GTATCAGCCA AGCTATTATC AAAATAATCC TGCTAGCCAA AACCCTCAAG GTCCCGAGCT AGGCAACAGC CATGTCAAGC GTGGAGGGAG TTGGGCAACA CCACCTGGCT ATCTGCATAA TAGTTGGCGG ATTGGCACTC CCGACCAAAC AACCGATCGC TTAGGCTTTC GCTGCGCCGC CGATGTAAAT TAA
|
Protein sequence | MTVSPTFAVQ LSQLKQTASM SIRVLSQKTA IPEDTLDDWL KGKSRPRNWE RVIIVAAALQ ANYQQANNLL QAIKTSPLEL LPWHQIEYFI QPREQQIGTN VHKQTINPIL AMVQTWYEAQ IQLLQQEVKA DSVDHVEAHN PQLAPTTEVK PNPSDPIAII HTEIHEQTKH QKHINLSLPK KRLLWLFGIS SITIMLMGLN LQHAKSNDQS VIDIPSQHNS NLTNSMITIP AGFFIQGSNY ADIAYYAQLC IDYGAACTEL EFDDEFDQNG QARQVFLNSY RIDKYEVTNA QFAQFVEQTQ YITYAERQGE SMILEVIETA GSKETLNFSA IKGAFWKQPY GPNSSIDDKA DYPVIHIHYE DAVAYCTAKH KRLPTEAEWE KAARGVEGWR FPWGNEWKSG LSNHAIPLRS HILQVRGLQA IGQSPQSISP YGVHDLLGNV SEWTADWYQP SYYQNNPASQ NPQGPELGNS HVKRGGSWAT PPGYLHNSWR IGTPDQTTDR LGFRCAADVN
|
| |