Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4166 |
Symbol | |
ID | 5736027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5312684 |
End bp | 5313856 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641281320 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546926 |
Protein GI | 159900679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA AATTCCGACA AGCAGAATTA CATTATCGAC GTGGATTATC ACTTGAACAT GCTGGGCGAA TTGCTGAAGC TGTAGAAGAG TATCGGCAGG CATTACAAGA AAATCCGCAG CTTCGGGCTG CTCATGTGGC CTTGGCTAAA TATTATTTGC GCAACGGGCT ATTGGCCAAG GCGGCTGATG CTTGGCATGC AGTCGTGGCA ATCGAGCCAG ATTACGAGGC ATTAACCAAT TATGCCACCG TTTTGATCGA GTTAAAGCAT TATCACGAGG CTCGCCAGAT TTTGCGTTTA TGTGTCGAGC TTTTTCCTTT AGATACCTTT GTAACCTATG AACTGGCCTA TATTGATTTT GCTGAGGGCT TATATCAACA AGCGCTCGAT CAATTGCTCG ATGTACGGCC AATTTATAAC GATGAATGGG AATTTCACGA ATTAATTGGG CGTTGCCAAA TTAAGTTGCA ATTGTATGAT GCTGCTTTAG CGAGTTTTGG TCGCGCCATT CTGTTGGTCG ATGATGATGA GCAAATTGAG CAACTACAAG ATCTCGGCAG TATTGCCCGC CGCTACCAAG AATTTAGCCT CGTTAGCAAT GAAAAAGATG CTTGGTATGC GACCCATGGC CTGATTTGCC TTGGTAGTAA CCTTGATAAT GGGCTGAATC TCAAGCCTCA GGCTGAGTTT GCTTGGTCGT TTGAGGCAAT TGCCACAACC TTGCAACGTG CCGCCGCCTT GGCCGAAGCC CATGTTTGGC GTTGCGACCA AGTTTTAGCC TTCGATAGCC AGAGCAAGCC CCTAGCCCAA GCGCTTGCTC AGTTATTACA ACGCCCTTAC ATCCAAAAAA TTGGTGACCC CGAAAAAATA ACCTTGGTGG TGATGGCTGA GTTTGAGCAA AGCGCCATGC TCGAAGCCAT CGCCGAACAG TTGGATGGGC TGCATTGTAT TTTTGCCTTG AGCATGCGCA CCAGCCCCGA TTTGCTCGAT GATATTCCCG ATCTGATTGG CCTGCCAATT CAAAAACCCA GTTTGCCCTG GACAAGCCAA TCGATCGCGA CTGCCACTAA AACCTTGCTG GCAACGCTGG CGCAACTCAC GCCAGAGCCA AATCGTGAGC AACAGCTTGA TTATTACCGC GAACAGCACC GTTTAATTAG ACTAAACGAC TAA
|
Protein sequence | MADKFRQAEL HYRRGLSLEH AGRIAEAVEE YRQALQENPQ LRAAHVALAK YYLRNGLLAK AADAWHAVVA IEPDYEALTN YATVLIELKH YHEARQILRL CVELFPLDTF VTYELAYIDF AEGLYQQALD QLLDVRPIYN DEWEFHELIG RCQIKLQLYD AALASFGRAI LLVDDDEQIE QLQDLGSIAR RYQEFSLVSN EKDAWYATHG LICLGSNLDN GLNLKPQAEF AWSFEAIATT LQRAAALAEA HVWRCDQVLA FDSQSKPLAQ ALAQLLQRPY IQKIGDPEKI TLVVMAEFEQ SAMLEAIAEQ LDGLHCIFAL SMRTSPDLLD DIPDLIGLPI QKPSLPWTSQ SIATATKTLL ATLAQLTPEP NREQQLDYYR EQHRLIRLND
|
| |