Gene Bind_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0007 
Symbol 
ID6201674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp5533 
End bp6546 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID641704003 
Productporphobilinogen deaminase 
Protein accessionYP_001831155 
Protein GI182677009 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000143071 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.582224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGGA ATAGCACGAT CGCTAACCCG CCGATAGATC AACCCCTTCC CCGTCTGCGG 
CTCGGCACGC GCGGCAGCCC GCTGGCGCTT GCCCAGGCGC ATGAGCTGGC AGATCGTCTT
GCCCGCGCGC ATGGCTTTGC GAAAGAAGCA GTTGCCATCA CGATCATCCG CACGAGCGGC
GACATGATTC AGGACCGGCC CCTTTCCTTG GCGGGAGGCA AGGGCCTGTT CACCAAGGAA
CTCGATCAGG CCTTGATCGA AGGCATGGTC GATCTCGCCG TCCATTCCGC CAAGGACCTG
CCGACCATCC TGCCGGAAGA CCTCATCATC GCCGGCTATT TGCCGCGCGA GGACGTGCGC
GATGTCTGGA TCTCCCCAAA GGCCGGCCAT CCGCGCGATT TGCCGCCGGG CTCTGTCGTC
GGTACGGCCT CGCTGCGGCG CGGTGCACTT TTGAAACGGC TGCGCCCCGA TCTCGAAGTC
AGATTATTGC GCGGCAATGT CGAGACGCGG CTCGCCAAAC TGGCCGCCGG GGAGGTCGAT
GCGACTTTAC TGGCACTGGC TGGCCTCCGT CGCCTCGGCC TTGCCGACAA GGCGACACAA
GTGCTGGCGA TCGAGGATTT TCTGCCCGCC GCCGGGCAGG GCGCGATCGG CATTACGACA
CGGCGGGATG ATGCGGCCAC CCTGGCGCTT CTCGCGCCGA TTCTCGATCC GGCGACTCAT
GTGGCGCTCG CCGCCGAGCG CGGCTTCCTC ACCGTGCTCG ATGGGTCCTG CAAAACGCCG
ATCGGTGCTC ATGCCACGGT CGAACACGAT CAAGTCACTT TGCGCGGCAT CGTCTTGCGG
CCGGATGGAT CGGAATGGTT CGAGGCCTGT GAAAGCGGTC CCCTTGAAAG CGGTTCCCTG
GAGGCGGCGC GGGAATTAGG CGAAACCGCA GCGCGCGCTA TTCTGGCGCG GTTGCCGGAA
GGATTCTTCC AAGAGAGCGC CCAAGAAAAT GCCCAAAAAA ACGCTAAGGA GTAG
 
Protein sequence
MSRNSTIANP PIDQPLPRLR LGTRGSPLAL AQAHELADRL ARAHGFAKEA VAITIIRTSG 
DMIQDRPLSL AGGKGLFTKE LDQALIEGMV DLAVHSAKDL PTILPEDLII AGYLPREDVR
DVWISPKAGH PRDLPPGSVV GTASLRRGAL LKRLRPDLEV RLLRGNVETR LAKLAAGEVD
ATLLALAGLR RLGLADKATQ VLAIEDFLPA AGQGAIGITT RRDDAATLAL LAPILDPATH
VALAAERGFL TVLDGSCKTP IGAHATVEHD QVTLRGIVLR PDGSEWFEAC ESGPLESGSL
EAARELGETA ARAILARLPE GFFQESAQEN AQKNAKE