Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3432 |
Symbol | |
ID | 5735293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4321300 |
End bp | 4322685 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280579 |
Product | hypothetical protein |
Protein accession | YP_001546196 |
Protein GI | 159899949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAATC CACCACAAGG GCTGGAAGAA ACCACCCAAG GTGGCCCGCA GCGGCCTGTC GTGCCGCCAC AGCGGGTTCA AATTCGCTGT CAAAATTGCC AAACGCCGTT TGTGGTGGCA CTTTGGACAT ACGTCGATGT TGGGGTGCAG CCTGAATTAA AAGGTGCGAT GCTTTCAGGT CAATTAAATG TCGCAGTTTG CCCTAGCTGT GGCGCTGGCG GCATGCTGGC CTCACCCTTG ATCTACCATG ATCCTGAAAA GAAGTTTTTT TGGACGCTCT TTCCGCAAGA ATTGAATTTG CCAACCGAGG AGCAAGAACG CTTCGTTGGC GAGGCTTCGA AAGTGGCTAT GCAATTGTTG CCACCCGATG CGCCACGTGG CTATTTGCTG ACTCCCCGCC GTTTCATCAC CATGCAAACC ATGCTCGAAA CCTTGTTGGA GAGCGAAGGC ATTACTAAAG AGGTGATGCA AGCGCAACGC CAACGGGTCG AGTTGCTAAG CCAATTGGCC GAAGCACTCG AGGCTGATCG AGCCGAGAAT TTGCTCGATT CAGATCAAGG TAAATTGGCC CAAGTGGTGG CTGCTAACAA GGCGGCACTG GATGGCGAAT TTTTCCTCAC CCTCAATTCC TATATCGAAG CTGCCTTACA ACAAGGTCGC GAAGATAGTG CCGAAATGAT CGGCGAGTTG AGCCAACGGG TGATGCAGTT GAGTGGCTTT GATCCGGTAG CCGCTGGGTT ACAAGAGCCA GCCGTCGAAG ATGTGGTTGC CGCTTTGCGC GATGCCGATG ATGCCAGCCT CGAAGATGTG ATTAGCAATT ATCGCCCGTT GATCGATGAT GAAACCTTTG ATGCTTGGGA TGCCCAAATT GCTGCCTTGC CCGAAGCAGA CCAAGCTGAG GCTCAAGCGC GACGCGATCA TATCTACACC ACGCTGGAGC GTATGGATGC TGAAGCCCAA GCCATTTTTG AAAAAGCCAA TGGCCTGTTG CGTGATGCCT TGCAAGCCGA AGATCCGCGG GCGTTGTTAG TTGAGCGCCA TAAAGAGTTA AGCGAAGCCT TTTTCGTGGT GATCGATGCC AACTTGAATG CAGCGATGCG TACCAATCAG CAAATTATTG CCGAGCAATT GATGTTGTTG CGCCAAACTG CTGCCGAAGT GTTGCAAGAG GCCATGACTC CTCAAGAACG CTTGATCAAT CAATTATTGA GCGCTGAAAC TGCTGGTGAT GCTACCAAGT TGCTGCGCAA AAACATGGCT TTGGTCAATG GCGATTTCGT CAAGGAAGTC AACGAATTGG CCGAGCAAAT GGAAAAAGCT GGGCGTAAGG AAGTCGTCGA GCGCCTCCGC CAAGTCGCCC GCGAATCGGC GAGCCTCCTG TTTTAA
|
Protein sequence | MTNPPQGLEE TTQGGPQRPV VPPQRVQIRC QNCQTPFVVA LWTYVDVGVQ PELKGAMLSG QLNVAVCPSC GAGGMLASPL IYHDPEKKFF WTLFPQELNL PTEEQERFVG EASKVAMQLL PPDAPRGYLL TPRRFITMQT MLETLLESEG ITKEVMQAQR QRVELLSQLA EALEADRAEN LLDSDQGKLA QVVAANKAAL DGEFFLTLNS YIEAALQQGR EDSAEMIGEL SQRVMQLSGF DPVAAGLQEP AVEDVVAALR DADDASLEDV ISNYRPLIDD ETFDAWDAQI AALPEADQAE AQARRDHIYT TLERMDAEAQ AIFEKANGLL RDALQAEDPR ALLVERHKEL SEAFFVVIDA NLNAAMRTNQ QIIAEQLMLL RQTAAEVLQE AMTPQERLIN QLLSAETAGD ATKLLRKNMA LVNGDFVKEV NELAEQMEKA GRKEVVERLR QVARESASLL F
|
| |