Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3339 |
Symbol | |
ID | 5735209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4208713 |
End bp | 4210575 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280486 |
Product | hypothetical protein |
Protein accession | YP_001546103 |
Protein GI | 159899856 |
COG category | [S] Function unknown |
COG ID | [COG4412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGCT GGCGAATCTG CCTTTTGCTG TGGATGTTGG TGGGTTGTAG CGCAACCGCA CCGACAACCA GCACCCAAAT TCCAATTCAA CCAAGCTTGC CAACTCCAGC TCAAACTCTC AGCCCGACTC AAGCATTGGC CACAAGCATC CCAATAATTG CCGATGAAGC GGCGGTGGAA CGAGCCAGCC AACAAGGGCT TGAGCGCGAT TTAGCCCAAC TTGCCGTTGA TTGGCGCTTG ATTAGCGAAA AACCTCAACC ACTGCGTTTG GAAATGCCCC CACCATACGA GCGCCGTAGC TTTTGGGTTA CCGATTTAAC CAGCAATCAA CAGCGCAATA TCAGTGCCAC CCTTCAACTC AGCACCACCC ACTTATTAAT TTATGTCGCC GATGATTTGC CGGTTGAGCA ACAAGCGCTG ATTAATGCTG CTCAGCAATT TGAACAGGTT GGTTGGCCGT TGCTAGCCAA ATGGTATCCG CAACAGGCTT GGCCCCAAGT ACCTGTAACC GTCTTGAATG CTGCGGTCAA CGGGGCGGGC GGCTATTACG CCAGCGATAA CGAATTACCC CAAGCGATCA ATCCATATTC CAACGAACGC GAGATGTTGG TGATTAACGC TGCGGCCATG CCACCCAGCG ATTTTGGCTA TGTCGCCACG TTAATTCACG AAATGCAGCA TCTGTTGCAT CGGAATGTGC TGAGCCACCC CGCCACTTGG CTCAACGAAG GCGCTTCGAT GTTGAGCGAA GATCGTTCAG GCTATAGCAA CGATAGCTTG GCACTCGATT TTCTGGCCTC GCCGGATACC CAACTCAATG CGTGGGCCAG CAGCCCTGGC ACTGCGCTCA AACATTATGG CGCGGCTCAG CTGTTCCTCA GTTACCTTGA TCAGCAACTC GACGGCTTGC CGATGGGAAC CTTGGCGGCG GCTGATGCTG GCGATAATTT GACCAGCATT ACCAGTTTGA TGACCACCCG CTATCCCGAT TTAACCAGCT TTGATCAGCT GTTTGCGGCG TGGGCAGTCG CCAATTGGGT GAATGATCCA ACGGTGGCTG ATGGCCGTTA TGGCTACGAT CTGCCCCGCG CTGTGTTGCC AGAGCAGGCC CAGAGCAGCG AACAAAACCT GAGCATTCGG CAATTTGGCA GCGATTATTT GGCCTTTGAG AACGCCAGTA GCGAACGAAC GCTCGAATGG CAGGGCAACA ACACAGTGCC TATTTTCGCC GCCGATGTGA CAAGTAGCGC CACATGGTGG AGCGGGCGTG GCGATGCGCG GGTCAGCACG CTCACCACAG CAATTCAAGT GCCTAGCGCG GGCGGCAGCC TGATTTATCG ACGGTGGTTT GATTTAGAGC AAGATTACGA TTATGCCTAT CTCAGTCTTT CGCAAGATAA CGGCCAAACC TGGCAAGCAA TTGCGACCCA AGCCAGTACT GGAGCCAATC CGGTTGGCTT GAATATTGGG GCTGGCTGGA CAGGCCAACA AACCACGTGG CAAGCAGAAA GCGTTGATCT CACGCCGTGG GCAGGCCAAC AGATTCAATT GCGATTTTGG GTGATCAACG ATGAAGCGTA TAATGCTGCT GGTTTAGCCT TGAGCGATCT GACAATCGAT GGGGTAACGG CTGAATGGGT TGGGACTGGC TTTGTGCCAG TTCGTAATCA ATTGGCACAG CGTTGGGTGC TCACGGCGGT GCTCTATGAT CAAGCTGGGG TTGCGGAAGT TGTCTCAATT CCAACCGATA ATGGCCAAGC GCGTTGGCTG ATTCCGGCCA ATCGGCGAGC GGTTTTGGTG GTCAATGCCA CAACTCAAGG CACCACCGAA GCAGCCAATT ACAGCTATAA CGTCACACCG TAG
|
Protein sequence | MRRWRICLLL WMLVGCSATA PTTSTQIPIQ PSLPTPAQTL SPTQALATSI PIIADEAAVE RASQQGLERD LAQLAVDWRL ISEKPQPLRL EMPPPYERRS FWVTDLTSNQ QRNISATLQL STTHLLIYVA DDLPVEQQAL INAAQQFEQV GWPLLAKWYP QQAWPQVPVT VLNAAVNGAG GYYASDNELP QAINPYSNER EMLVINAAAM PPSDFGYVAT LIHEMQHLLH RNVLSHPATW LNEGASMLSE DRSGYSNDSL ALDFLASPDT QLNAWASSPG TALKHYGAAQ LFLSYLDQQL DGLPMGTLAA ADAGDNLTSI TSLMTTRYPD LTSFDQLFAA WAVANWVNDP TVADGRYGYD LPRAVLPEQA QSSEQNLSIR QFGSDYLAFE NASSERTLEW QGNNTVPIFA ADVTSSATWW SGRGDARVST LTTAIQVPSA GGSLIYRRWF DLEQDYDYAY LSLSQDNGQT WQAIATQAST GANPVGLNIG AGWTGQQTTW QAESVDLTPW AGQQIQLRFW VINDEAYNAA GLALSDLTID GVTAEWVGTG FVPVRNQLAQ RWVLTAVLYD QAGVAEVVSI PTDNGQARWL IPANRRAVLV VNATTQGTTE AANYSYNVTP
|
| |