Gene Bind_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3664 
Symbol 
ID6198644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp4155196 
End bp4156245 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content57% 
IMG OID641707615 
Productaldo/keto reductase 
Protein accessionYP_001834705 
Protein GI182680559 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.160957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGAC GTCGATTGGG ACGAACGGAT CTGTTCGTTT CGGAAATTTG CCTTGGCACC 
ATGACTTGGG GTCAACAGAA TACCGAGGCC GAGGGCCACG CGCAGATGGA TTATGCGGTG
GAACAAGGCA TCAATTTCTT CGATACCGCG GAAATGTATT CGATCCCTCC CAAGGCCGAG
ACACAAGGCT CGACCGAGCG CATCATCGGG ACTTGGTTCA AGGCGCGCGG CAATCGCGAC
AAGATCATTC TCGCCTCAAA AGTCTCTGGA CGTGGCGAGG CCACCTGGCT GCGCCCGGAT
GGCTCGAAAA CCCGCATCGA CCGCAAAAAT ATCGAGGCAG CGATCGAGGG TTCGCTCAGG
CGGTTACAAA CCGATTATAT CGATGTCTAT CAATTGCATT GGCCTGATCG GCCCCTGGCT
TTATTCGCCG GCCAGACGAC GACCTTCAAG GACGTGCCGG AACCGCTCGA AAATCCGATC
GAGGAAACCG TCGAAATCCT GGGCGATCTC GTCAAGACCG GCAAGGTCCG TCATATCGCT
TTGTCCAACG AAACGGCCTG GGGCACGATG CGTTTCGTGC AAGCCTCCGA AGCGGGGCAT
GGACCGCGTG TCGTCTCGAT CCAGAACGCC TATAATCTTA TAAACCGGAC CTTCGAGATC
GGCCTGGCCG AAGTGGCCTT GCGCGAGAAT GTGGGTCTTT TGGCCTATTC CCCTTTGGCG
CAAGGTTATC TTACCGGCAA ATATCAGGGG GGCGCCCGCC CGCCTGGGGC GCGTACGACC
TTGTTTGATC GTGGCCAGCG GTATGAAAAG CCCGCCGCCT CCGAGGCAAT CGACGCCTAT
CTGGCCCTTG CCAAGGAGTT CGGCCTCGAT CCCGCGCAAA TGGCGCTCGC CTTCGTGACA
TCGCGACCCT TCGTCACATC CAATATTATC GGTGCGACGA CGATGGAGCA GTTGAAGGTC
GATATTGCCT CGATCCATGT GAAGATCGCG GCCGATCTCG AAAAGCGGAT CGACGCCCTC
CATCAAATTT ACAGCAACCC TTGCCCATAG
 
Protein sequence
MERRRLGRTD LFVSEICLGT MTWGQQNTEA EGHAQMDYAV EQGINFFDTA EMYSIPPKAE 
TQGSTERIIG TWFKARGNRD KIILASKVSG RGEATWLRPD GSKTRIDRKN IEAAIEGSLR
RLQTDYIDVY QLHWPDRPLA LFAGQTTTFK DVPEPLENPI EETVEILGDL VKTGKVRHIA
LSNETAWGTM RFVQASEAGH GPRVVSIQNA YNLINRTFEI GLAEVALREN VGLLAYSPLA
QGYLTGKYQG GARPPGARTT LFDRGQRYEK PAASEAIDAY LALAKEFGLD PAQMALAFVT
SRPFVTSNII GATTMEQLKV DIASIHVKIA ADLEKRIDAL HQIYSNPCP