Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4127 |
Symbol | |
ID | 5735988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5274315 |
End bp | 5275685 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281281 |
Product | hypothetical protein |
Protein accession | YP_001546887 |
Protein GI | 159900640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGAT TGTTATGTTT GCTGATCATT CTGCTGCTCT GGCCGCATCC TGTGGCTGCT CAAACCAATC TTCCTCAATG GCTTGAACGC AAAACTACCT ATATTTCTAT TTTGTATCCC CAAGGCAGCG AAGCCGAAGC CGAGCGCTAT GCTGGCTATA GCGACGCGAT CTACGAGGAA GTTACGGCGG TCGTTGGCTA TCGACCTGCC CCACCCTTGA CGCTGCGCAT CTACCCTACC AAAGAACTCT ATCAACAGGT AAACCCGGCG GCCCGTTGGC TGGAAGGCAT CGTGGCTCAT GCCCACACTG GGCGACGCGA AATTAGCATT GCTGTGCAGC AAACTGTAGG CATGAGCGAT GAAGAATTAC GCAATAATGT GCGCCATGAA TTAATGCATA TCATTGCTGC CGAGCTTTCC GATGGCCGAT TAAGTACGAT GTGGCAGGAA GGCATTGCCC AATATGTTGA AGTGCCAACC AGCCAAAGCG GCTATAAAAT TGCCTTGCTC AAACAAGCCC TCGAGAATAA CGCGCTTGCG ACCTGGCGCT TATTAGATAG TGCTGGCGCA GTCTACGATA ATCCAGAACT GGGCTATCCA CAAAGCTGGT CGATGGTCTC ATTTTTGATT CAGCGTTATG GTATGGCACG ATTTTTGGCT TTTTTAGAGG CGTTGCGCAC GGCTAGTGGC TATCGTTCAG CCCTCAGCCA AGCCTATAGT CTTAGTGCCG AAAGCCTTGA AAGTGAATGG CTGGCTCAAC TGCCAACCTG GATCGATGGT GGTTGGAAGC AAGCGCCGAG TGTCGCCTTC GATCAAGCGA GCATCGAAAC AGCTCTGGCT GCTGGACGTT ATAGCGAGGC CTTGACTGCC GCCGAAACCG CCCTGACGAT CAAGGATGAT CCGGCGATTG CGGCGCTGCG CGAACAAGCT CGCAAAGGCG TGCGAGCCGA AGATGCTGCT GCCGCCGCCC GCGTGGCGCT ATTGGAAGGC CGTTATGCTG AGGCCAAAAC CGCGATCGAA CAAGCGTTGC CATTATTTGC TGATTTGGCG CGAATTGATC GCCAAAAGCT GCTGAATGAT TACCTACAAC GCGCTGACCA AGGCCTCAAA GCCCAGCAAC TACTCGAAAC CGCCCGCCGC GATTTGAATG GAATTCGCAT AGTTGCTGCG CGTAATAATA TTGAGCAAGC TGCTAATTTA TTTAGCCAGC TTGGCGATAA TAATGGGCGC AGCCAAGCCG CTCAGTTGCT TGAATCGCTT AATCTACGGC TAAAAATTGT GGGAATTGGC TTGATTGTGG TGGTTGGGTT GGGTTTAGCG TGGAATATTG ATCGGCGACG AGCTATGCGC AAGCGGATGT TGCCGCTGTA G
|
Protein sequence | MRRLLCLLII LLLWPHPVAA QTNLPQWLER KTTYISILYP QGSEAEAERY AGYSDAIYEE VTAVVGYRPA PPLTLRIYPT KELYQQVNPA ARWLEGIVAH AHTGRREISI AVQQTVGMSD EELRNNVRHE LMHIIAAELS DGRLSTMWQE GIAQYVEVPT SQSGYKIALL KQALENNALA TWRLLDSAGA VYDNPELGYP QSWSMVSFLI QRYGMARFLA FLEALRTASG YRSALSQAYS LSAESLESEW LAQLPTWIDG GWKQAPSVAF DQASIETALA AGRYSEALTA AETALTIKDD PAIAALREQA RKGVRAEDAA AAARVALLEG RYAEAKTAIE QALPLFADLA RIDRQKLLND YLQRADQGLK AQQLLETARR DLNGIRIVAA RNNIEQAANL FSQLGDNNGR SQAAQLLESL NLRLKIVGIG LIVVVGLGLA WNIDRRRAMR KRMLPL
|
| |