Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2543 |
Symbol | |
ID | 5734421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3264814 |
End bp | 3266379 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 641279683 |
Product | hypothetical protein |
Protein accession | YP_001545309 |
Protein GI | 159899062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000588775 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATGA ACGAAATTTG GACAACCCAA GATCAATATT ATACATATAT TAAGAAAATT GCTGATAATA TTTTTAAGGA CTCAAATATT ATTTATGAAA TATTGATATT TGGTGTTAAG TATCATGATA GAAATCAATG TTCAGTAATA TTATACCCAG ATAATAAAAA TAATATTATT TTATCTGAAA TGATAAAAAA GAAATTAATG TCTAGTAACA AGGGAATTAA AAAAGGAGTA ATCCCTGAAA AACGTAGTAC AAATCTTGAT TCAAATGATA TTTCGCATAG AGCTATGATA AATTCATTTT TAAATGCTAA GAAGAAATTT AATTTACAAA ATAAAAATAT AAAATATAGA AGAAAATATA ATTTAGATAC ATATAAATTA TTTAATAAAA ATAATAAATG TTTTGCATAT TTCTCGCAAG TCAGGGATAT TTTAGGTTAC GATGTTTATA TAATATTTAA TCTAGATAAA AAATCATCAT CAAATTATCA TAAATTTCAT GACCCTAGGG AAGTTGGGCG AGGTGAAATA CGATCATTTA TTGATGCAGT TATCTATTCA TACCTTAATA AATGCGTTTC CGTTTTAGAA TCATTATTTA AAGACGAAAT AGTTTATAAA AATAATGAAA TAATATCAAT TTCAAATAAG ATACTAATCT CATCAGGGGA ATCTTTATTG AAAGGGGTTA TTGAAATATT AGATATCGAT AAGAAGGATG ATGAAGTACA AATAGATATG TTTGATTTGA TTAATTCTAT ATCACAAATG AGCTATGAAA ATACGTATAT TAATGGATAT TTAGTATTGA TTAATGATGA TAAATTAAAG AATATGCAAT ATATATTTAA GTTAAAAGAT CATAAAGATT ATAGAATTAA TAAAAGTAAA ACCACGAGAA AATTAATTCA AATGGCAAGA AATAATATTG TATTAGCTAC AGATTCAATA TTCATCTATG GATTGTATTA TAAAAACATA ATTGAGCAAT TAAAAGATGT ATTCATTATA AAAATTATTG GACAAAGTAA ATGGGAATTG TATTATAATA AATGGATGTT GATGTATGTT GAATATAGTA TACCAAGGAT AATAACAGGA ATTAATCAGG ATGAAATTCG TAATCGTATT ATCAACGCTG AATTTAGGCA AGAATACATA GATGATATAC ATAGATTTAT TTTAGATGTC ATAGAAGTTG CACTAAAAGA ACGAAAAGGG ATTACATTAA TTTTTGGTAA CGATGTACAA GAAGAGATCG CTAATAATAA TATTGTTTGT ATTAAAATTG AAGTTAGGTC AATTAGAGAT ATTTTTAAAT CAATAAACTC TCTACTTACT ATCGATGGAG GAATATTAAT GGATAAAGAA GGTCGATGTT ATGCAATTGG GGCAATTTTA CAATATGGCT CTTCTGAACA AAATGATCCG GCTCGCGGAT CACGCTATCA TGCTGCAAAA GCTTATTCTG AGAAGGCGAA AGAGAAAAAT TCGCGCTTTG TAGTTGTTAT CTCAGAGGAT GGATATATTG ACTTTTTTCC TGAACCTGAA AAATAG
|
Protein sequence | MSMNEIWTTQ DQYYTYIKKI ADNIFKDSNI IYEILIFGVK YHDRNQCSVI LYPDNKNNII LSEMIKKKLM SSNKGIKKGV IPEKRSTNLD SNDISHRAMI NSFLNAKKKF NLQNKNIKYR RKYNLDTYKL FNKNNKCFAY FSQVRDILGY DVYIIFNLDK KSSSNYHKFH DPREVGRGEI RSFIDAVIYS YLNKCVSVLE SLFKDEIVYK NNEIISISNK ILISSGESLL KGVIEILDID KKDDEVQIDM FDLINSISQM SYENTYINGY LVLINDDKLK NMQYIFKLKD HKDYRINKSK TTRKLIQMAR NNIVLATDSI FIYGLYYKNI IEQLKDVFII KIIGQSKWEL YYNKWMLMYV EYSIPRIITG INQDEIRNRI INAEFRQEYI DDIHRFILDV IEVALKERKG ITLIFGNDVQ EEIANNNIVC IKIEVRSIRD IFKSINSLLT IDGGILMDKE GRCYAIGAIL QYGSSEQNDP ARGSRYHAAK AYSEKAKEKN SRFVVVISED GYIDFFPEPE K
|
| |