Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3822 |
Symbol | |
ID | 6197996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010580 |
Strand | + |
Start bp | 130105 |
End bp | 131553 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641703950 |
Product | hemerythrin HHE cation binding domain-containing protein |
Protein accession | YP_001831102 |
Protein GI | 182676955 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.690759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGG AAAAGGAAGA AAAGGAAAGC GCCGATCCCA AAGGTCAAAC CGGAGCCAAA AGCTCATCGG ACGTTACCGG AGAAGACGCC AAAAAACAGG ACGCTATATC GATGTTGAAG GCGGACCATC GGGAGATAGA GCAATTGTTC GACCGATACA AAGCTGTCTC TCGTCGGGCT GACCGCGCGA AGATCGCGAA GGAAGTCTGC AATGCGCTGA CGATCCATGC GATACTGGAG GAGGAAATTT TTTATCCTGC TTGCCGTGAG CTTATCGACG ACCAACCGCT CGACGAAGCA CAAGTCGAAC ACGACAGTGC GAAGGTGCTC ATCGAAGAGC TGATCTCCGG AAGACCCGAA GACCCTTTTT ACGATGCAAA GGTCAATGTA TTAGCCGAAC AGGTTAAGCA GCACATTCGA GAAGAAGAAG GGAAGCCCGA CAGCATTTTC GCAAAGGCCG AGGCTGCCGG AGCCGACTTC GTCGTCATTG GCGAAGAGCT TAAGAAGCGC AAAGCGGAAC TTTTGAGCAA GGCTAGCCGA GAGACGCTGC AAGCGGAACC GTGCTCATTC AAAGCCCTTG CAAATCTCAC CAAGGAAGAA GACATGGCAT CTTATCAACG CGGCCGAGAC TATGAGGAGC GCGGCCCGTC GCGCTCGCGT TATGATGAAG AGACAGGTCG GCGGCGGGAC GAGCAGGGCC GCTTCATGAG CAACGAGGAT TATGGCAATC GTGGTGGACA AGAACGCAGG TATCGCGTCG GGTCCGAGCG CGATGAATAC AGCCGTTCCT CGAGCGATCG CGACCAAGGT TATCGTTCGC GTCCATCCTA TGAAGACGAG GAATATTCCT CACGCGGCCA GCGTATGCCC GAGCGCGACG AATATGGTCG ATTTGTGAGC GATGATGAGC GTCACGGCCA GAGTTATTCC CACGGACGCG ATTACGAGAA TGAACGTCGC GGGTCTCGCA GCCATGGTGG CTGGTTTGGG GATCCCGAAG GCCATGCGGA AGCCGCCCGC AGAGGTTGGG ACGAGCGCGA GGGCGAAGGC CGCGCTTATC GCGATGAGAA CGAACGCAGA AGCTCGTCGC AGGGAAGCGG ACGGCGATAC GAGCAGCGTT CACGGCAGGA GGAGGATGAC GAGCGACGCC AGGGCAGCGG ACGCGATTAC GAGGATGAAC GTCGCGGCTC TCGTAGCCAC GGTGGCTGGT TTGGGGATCC CGAAGGCCAT GCGGAAGCCG CCCGCAGAGG TTGGGACGAG CGCGAGGGCG AAGGCCGCGC TTATCGCGAT GAGAATGAAC GCAGAAGCTC GTCGCAGGGA GGCGGACGGC GATACGAGCA GCGTTCACGG CAGGAGGAGG ATGACGAGCG GCACCAGGGC AGCGGCTGGT ACGGAGATCG GGAAGGTCAT TCAGAGGCGT CCCGTCGAGG CTGGGAACAT CGGCGATAA
|
Protein sequence | MKQEKEEKES ADPKGQTGAK SSSDVTGEDA KKQDAISMLK ADHREIEQLF DRYKAVSRRA DRAKIAKEVC NALTIHAILE EEIFYPACRE LIDDQPLDEA QVEHDSAKVL IEELISGRPE DPFYDAKVNV LAEQVKQHIR EEEGKPDSIF AKAEAAGADF VVIGEELKKR KAELLSKASR ETLQAEPCSF KALANLTKEE DMASYQRGRD YEERGPSRSR YDEETGRRRD EQGRFMSNED YGNRGGQERR YRVGSERDEY SRSSSDRDQG YRSRPSYEDE EYSSRGQRMP ERDEYGRFVS DDERHGQSYS HGRDYENERR GSRSHGGWFG DPEGHAEAAR RGWDEREGEG RAYRDENERR SSSQGSGRRY EQRSRQEEDD ERRQGSGRDY EDERRGSRSH GGWFGDPEGH AEAARRGWDE REGEGRAYRD ENERRSSSQG GGRRYEQRSR QEEDDERHQG SGWYGDREGH SEASRRGWEH RR
|
| |