Gene Bind_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2166 
Symbol 
ID6198706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2472223 
End bp2473236 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content62% 
IMG OID641706155 
Productglycine oxidase ThiO 
Protein accessionYP_001833275 
Protein GI182679129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.232699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.765835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTC GCATTATCGG CGCCGGAATC ATGGGATTGA CGACGGCTTT CGAATTTGCC 
TCCCACGGGG CGGATGTCGA AGTCGTCGAA CAGCGTGATG GCCCCGGCAA GGGCTGTTCT
TTCCTCGCTG GCGGCATGAT CGCGCCCTGG TGCGAGGTCG AAAGCGCCGA ACCCATTGTC
GGCACTATGG GGCTCGAGGC ACTGCGGTTC TGGACCGAAG ATGTACCGGT GGCGACGCGT
CAAGGAAGCC TAGTCCTTGC CCCGCCACGC GACCGGCCGG AACTTGCCCG TTTCTCTCGC
CTTACCAGCC ATTATGAACG GATGGACGGC GCGGCGCTCG CCGCGCTCGA GCCTGATCTC
GAAGGGCGTT TTGGCGAGGC GCTTTTCTTT CCCGAAGAAG CCCATCTTGA TCCACGCCAA
GCCACGGCGG CCTTGGGTGA GCGATTGGCG GCAGCGCCCA ATGTCATTTT GCGTTACGGC
ACCGAAGCCG AAGACCTGTC CGAAGCGGGT GCTGACTGGA TCATCGATTG CCGTGGCCTT
GCCGGGCGGG ATGCCTTGCC CGATTTGCGC GGGGTCAAGG GCGAAATGCT GGTTCTGCGG
ACGGGGGATA TCAGACTCGC AAGGCCCATC CGGCTTCTGC ATCCGCGTTT TCCGGTCTAT
ATCGTGCCGC GCGGCGACGG CCGTTTCATG ATCGGCGCCA CCTCGATCGA AAACGAGGAA
GAGGGGCGGA TCACGGCCCG TTCCATGGTT GAACTTCTGA GCGCGGCCAT GACCGTGCAT
CCGGCCTTTG GAGAGGCGGA AATCATCGAA ACCGGGGCCG GTTTGCGTCC GGCCTTTCCC
AATAATCTGC CACGCCTGCG GGTTGAGGGT CATGTCGTTC GGGCCAATGG TCTTTATCGG
CATGGTTTTC TTCTGGCGCC CCCCGTGGCG CGCAGGATCA GGCGCATGGT TCTTGAAGGC
GCTTCTTTTC CGGAGGTCAT GGATGCAGAT CCGCGTGAAC GGCAAAGAGC TTGA
 
Protein sequence
MRIRIIGAGI MGLTTAFEFA SHGADVEVVE QRDGPGKGCS FLAGGMIAPW CEVESAEPIV 
GTMGLEALRF WTEDVPVATR QGSLVLAPPR DRPELARFSR LTSHYERMDG AALAALEPDL
EGRFGEALFF PEEAHLDPRQ ATAALGERLA AAPNVILRYG TEAEDLSEAG ADWIIDCRGL
AGRDALPDLR GVKGEMLVLR TGDIRLARPI RLLHPRFPVY IVPRGDGRFM IGATSIENEE
EGRITARSMV ELLSAAMTVH PAFGEAEIIE TGAGLRPAFP NNLPRLRVEG HVVRANGLYR
HGFLLAPPVA RRIRRMVLEG ASFPEVMDAD PRERQRA