Gene Bind_0063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0063 
Symbol 
ID6199142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp65496 
End bp67340 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content61% 
IMG OID641704060 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001831211 
Protein GI182677065 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC AGCACAAACG CTCAAGTCTC GTTCCCGAGA CTGTGACCTG CGGCCCTCTG 
CCGGGATCGC GCAAGGTCTA TCACCATCCT CAGAGCCATC CCGAACTCAG CGTGCCGTTC
CGCGAGATCG CGCTCGATCC GGCGTCTGGC GAGCCGCCGG TGCGCGTCTA TGATGCCTCT
GGTCCCTATT CGGAAGAGAA TTTCAAGCCC GATCTCGCCA AGGGCCTGCC GCGCACCCGC
GCCGTCTGGC TCGAAAAACG CGCGGGCCAT GAGTCCTATG CCGGACGTGC CGTCAAGCCG
GAAGACAATG GCTCCGTTTC GGCCGACCGC TTGGTGCCGC CCTGCCCGGC CAATGCCGCG
CCCTTGCGCG GGCGCACCGG TGCCTTGGTG ACGCAATATG AATTCGCCAA GGCTGGCATC
ATTACGGAAG AAATGATCTA CGCCGCCGCG CGCGAAAATC TCGGCCGCGA ACAGGCCGTG
GAGGGAGCCG CCGCCCGGCT CAAGGATGGC GAAAGCTTTG GCGCCTCGAT CCCCGAATTC
GTGACACCGG AATTCGTGCG CGACGAGATC GCGCGGGGCC GTGCGATCAT TCCCGCCAAT
ATCAATCATC CCGAACTCGA GCCGATGGTC ATCGGCCGCA ATTTTCTCGT GAAGGTCAAT
GCCAATATCG GCAATTCGGC TGTGACCTCT TCCGTTGCGG AAGAAGTCGA AAAGCTCGTC
TGGGCGATCC GCTGGGGCGC CGACACGGTC ATGGACCTTT CGACCGGCCG CAACATTCAC
AATATCCGCG ACTGGATCAT GCGCAATGCG CCGGTGCCGA TCGGCACCGT GCCGATCTAT
CAGGCGCTCG AAAAGGTCGA CGGCGATCCG ATCAAACTGA CCTGGGAAAT TTTCCGCGAC
ACTTTGATCG AACAAGCCGA GCAGGGCGTC GATTATTTCA CGATCCATGC AGGTGTCCGG
CTGGCCCATG TGCCGCTGAC CGCCAAGCGC ACCACGGGCA TCGTCTCGCG CGGTGGCTCG
ATCATGGCGC GCTGGTGCCT CGCGCATCAC AAGGAAAGCT TCCTCTACGA GCATTTCGAC
GAGATTTGCG ACATCATGCG CGCCTATGAT GTCTCGTTCT CGCTGGGCGA TGGCCTGCGG
CCCGGCTCGA TTGCCGATGC CAATGATGCG GCGCAATTCG CCGAACTCGA AACGCTGGGC
GAATTGACCA AGATCGCCTG GGCCAAAGGC TGTCAGGTCA TGATCGAAGG CCCCGGCCAT
GTGCCGATCC ATAAGATCAA GATCAATATG GAAAAGCAGC TCAAGGAATG CGATGAGGCG
CCCTTCTATA CGCTTGGGCC GCTAACGACC GATATTGCGC CTGGCTATGA TCACATCACC
TCCGGCATTG GTGCGGCGAT GATCGGCTGG TTCGGCTGCG CCATGCTCTG CTATGTGACG
CCGAAGGAGC ATTTGGGTCT GCCCGACCGC GACGATGTGA AAGTGGGCGT GATCACCTAT
CGCATCGCGG CCCATGCGGG CGATCTCGCC AAGGGCCATC CGGCGGCTCA GATCCGCGAT
GATGCCGTCT CGCGGGCGCG GTTCGATTTC CGCTGGCAGG ATCAGTTCAA TCTCGGCCTC
GATCCGGAAA CGGCGGCGCA TTTCCACGAT GAGACCCTGC CGAAGGACGC CCATAAGGTC
GCCCATTTCT GCTCCATGTG CGGACCGCAA TTCTGCTCGA TGAAGATCAC CCAGGATCTG
CGTGTGGAAG CCGCCGCCAT GGCGGAAGAG GAAGCCAAGC GCGAACAGGG CATGGCCGAA
AAGAGTGCCG AATTTTTGGA GAAGGGCGGC AAGCTTTACG TATAA
 
Protein sequence
MNIQHKRSSL VPETVTCGPL PGSRKVYHHP QSHPELSVPF REIALDPASG EPPVRVYDAS 
GPYSEENFKP DLAKGLPRTR AVWLEKRAGH ESYAGRAVKP EDNGSVSADR LVPPCPANAA
PLRGRTGALV TQYEFAKAGI ITEEMIYAAA RENLGREQAV EGAAARLKDG ESFGASIPEF
VTPEFVRDEI ARGRAIIPAN INHPELEPMV IGRNFLVKVN ANIGNSAVTS SVAEEVEKLV
WAIRWGADTV MDLSTGRNIH NIRDWIMRNA PVPIGTVPIY QALEKVDGDP IKLTWEIFRD
TLIEQAEQGV DYFTIHAGVR LAHVPLTAKR TTGIVSRGGS IMARWCLAHH KESFLYEHFD
EICDIMRAYD VSFSLGDGLR PGSIADANDA AQFAELETLG ELTKIAWAKG CQVMIEGPGH
VPIHKIKINM EKQLKECDEA PFYTLGPLTT DIAPGYDHIT SGIGAAMIGW FGCAMLCYVT
PKEHLGLPDR DDVKVGVITY RIAAHAGDLA KGHPAAQIRD DAVSRARFDF RWQDQFNLGL
DPETAAHFHD ETLPKDAHKV AHFCSMCGPQ FCSMKITQDL RVEAAAMAEE EAKREQGMAE
KSAEFLEKGG KLYV