Gene Bind_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3822 
Symbol 
ID6197996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010580 
Strand
Start bp130105 
End bp131553 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content58% 
IMG OID641703950 
Producthemerythrin HHE cation binding domain-containing protein 
Protein accessionYP_001831102 
Protein GI182676955 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.690759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGG AAAAGGAAGA AAAGGAAAGC GCCGATCCCA AAGGTCAAAC CGGAGCCAAA 
AGCTCATCGG ACGTTACCGG AGAAGACGCC AAAAAACAGG ACGCTATATC GATGTTGAAG
GCGGACCATC GGGAGATAGA GCAATTGTTC GACCGATACA AAGCTGTCTC TCGTCGGGCT
GACCGCGCGA AGATCGCGAA GGAAGTCTGC AATGCGCTGA CGATCCATGC GATACTGGAG
GAGGAAATTT TTTATCCTGC TTGCCGTGAG CTTATCGACG ACCAACCGCT CGACGAAGCA
CAAGTCGAAC ACGACAGTGC GAAGGTGCTC ATCGAAGAGC TGATCTCCGG AAGACCCGAA
GACCCTTTTT ACGATGCAAA GGTCAATGTA TTAGCCGAAC AGGTTAAGCA GCACATTCGA
GAAGAAGAAG GGAAGCCCGA CAGCATTTTC GCAAAGGCCG AGGCTGCCGG AGCCGACTTC
GTCGTCATTG GCGAAGAGCT TAAGAAGCGC AAAGCGGAAC TTTTGAGCAA GGCTAGCCGA
GAGACGCTGC AAGCGGAACC GTGCTCATTC AAAGCCCTTG CAAATCTCAC CAAGGAAGAA
GACATGGCAT CTTATCAACG CGGCCGAGAC TATGAGGAGC GCGGCCCGTC GCGCTCGCGT
TATGATGAAG AGACAGGTCG GCGGCGGGAC GAGCAGGGCC GCTTCATGAG CAACGAGGAT
TATGGCAATC GTGGTGGACA AGAACGCAGG TATCGCGTCG GGTCCGAGCG CGATGAATAC
AGCCGTTCCT CGAGCGATCG CGACCAAGGT TATCGTTCGC GTCCATCCTA TGAAGACGAG
GAATATTCCT CACGCGGCCA GCGTATGCCC GAGCGCGACG AATATGGTCG ATTTGTGAGC
GATGATGAGC GTCACGGCCA GAGTTATTCC CACGGACGCG ATTACGAGAA TGAACGTCGC
GGGTCTCGCA GCCATGGTGG CTGGTTTGGG GATCCCGAAG GCCATGCGGA AGCCGCCCGC
AGAGGTTGGG ACGAGCGCGA GGGCGAAGGC CGCGCTTATC GCGATGAGAA CGAACGCAGA
AGCTCGTCGC AGGGAAGCGG ACGGCGATAC GAGCAGCGTT CACGGCAGGA GGAGGATGAC
GAGCGACGCC AGGGCAGCGG ACGCGATTAC GAGGATGAAC GTCGCGGCTC TCGTAGCCAC
GGTGGCTGGT TTGGGGATCC CGAAGGCCAT GCGGAAGCCG CCCGCAGAGG TTGGGACGAG
CGCGAGGGCG AAGGCCGCGC TTATCGCGAT GAGAATGAAC GCAGAAGCTC GTCGCAGGGA
GGCGGACGGC GATACGAGCA GCGTTCACGG CAGGAGGAGG ATGACGAGCG GCACCAGGGC
AGCGGCTGGT ACGGAGATCG GGAAGGTCAT TCAGAGGCGT CCCGTCGAGG CTGGGAACAT
CGGCGATAA
 
Protein sequence
MKQEKEEKES ADPKGQTGAK SSSDVTGEDA KKQDAISMLK ADHREIEQLF DRYKAVSRRA 
DRAKIAKEVC NALTIHAILE EEIFYPACRE LIDDQPLDEA QVEHDSAKVL IEELISGRPE
DPFYDAKVNV LAEQVKQHIR EEEGKPDSIF AKAEAAGADF VVIGEELKKR KAELLSKASR
ETLQAEPCSF KALANLTKEE DMASYQRGRD YEERGPSRSR YDEETGRRRD EQGRFMSNED
YGNRGGQERR YRVGSERDEY SRSSSDRDQG YRSRPSYEDE EYSSRGQRMP ERDEYGRFVS
DDERHGQSYS HGRDYENERR GSRSHGGWFG DPEGHAEAAR RGWDEREGEG RAYRDENERR
SSSQGSGRRY EQRSRQEEDD ERRQGSGRDY EDERRGSRSH GGWFGDPEGH AEAARRGWDE
REGEGRAYRD ENERRSSSQG GGRRYEQRSR QEEDDERHQG SGWYGDREGH SEASRRGWEH
RR