Gene Bind_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0101 
Symbol 
ID6200918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp115208 
End bp116452 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content55% 
IMG OID641704098 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001831249 
Protein GI182677103 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.724967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC ATATAGCTGC GCAAGCGCCA TGGGATGTGA ATCTGATCCG TCAGGATTTT 
CCGATTCTGT CCCTGGAAGT CTACGGCAAG CCGCTGGTCT ATCTCGACAA TGCCGCCTCG
GCGCAAAAGC CGAAGGAGGT CGTGGACCGT ATGGTGCACG CAACCTACCA CGAATATGCC
AATGTGCACC GCGGTCTGCA TTATCTCGCC AATGCCGCGA CGGATGCCTT CGAAGTCGCC
CGCGAAAGCG TGCGCTCGTT CCTGAATGCG GAAAGTGTCA ACGAGATCAT TTTCACGAAA
TCGGCGACGG AAGCGATCAA TCTTGTCGCG TCCTCCTTCG GGCAAGCCTT CATCAATGAA
GGCGATGAAA TCGTGCTCTC GATCATGGAG CACCACGCCA ATATCGTGCC CTGGAATTTC
CTGCGCGAGC GCAAGGGTGC GGTGCTGAAA TGGGTTGATG TCGATGATGA CGGCAATTTC
CTGATCGAGG AATTTGAAAA GGCTCTGTCG CCGAAAACCA AGATCGTCGC CATGACGCAT
ATGTCGAATA TGCTCGGCAC GATCACGCCG GTGAAGGAAA TCATCAAAAT CGCTCATGAT
CGCGGCATTC CGGTGCTGAT CGATGGATCG CAAGGGGCCG TGCATCTCGA GGTCGATGTT
CGCGATCTCG ACGCGGATTT CTATGTCGTG ACCGGCCATA AGCTTTATGG GCCGACCGGC
ATTGGCGCTC TTTACGGCAA GAAGGAATGG CTTGAGAAAC TGCCGCCCTT CCTTGGCGGC
GGTGAAATGA TCAATGAAGT GACGCGTGAC CGCGTGACCT ATAACGAGCC TCCGCATCGT
TTCGAGGCCG GCACACCGCC GATCATCCAG GCCATCGGCC TCGGTGCCGC TGTTGATTAT
ATGCAAAAGC TCGGTCGCAA TCGCATTCAT GCGCATGAAA TGGCGCTGAG CGATTATGCC
CATGAACGGC TGTCCAAAAT CAATTCGCTG AAAATTTTCG GACGCGCCAA GGGCAAGGGA
GCGATCATTT CCTTTGAAAT GAAGAATGCC CATGCGCATG ATGTCGCGAC GATCATCGAT
CGTTCGGGCG TGGCCGTGCG GGCCGGCACG CATTGTGCTC AGCCGTTGCT TGCCCGCTTT
GGCGTGACTT CGACCTGTCG TGCTTCCTTC GCTGCTTACA ATACGTTCGA GGAAGTCGAC
AAACTTGCCG AGGCGTTGAT CCGAGCCGAA GGCCTTTTTG CTTGA
 
Protein sequence
MNKHIAAQAP WDVNLIRQDF PILSLEVYGK PLVYLDNAAS AQKPKEVVDR MVHATYHEYA 
NVHRGLHYLA NAATDAFEVA RESVRSFLNA ESVNEIIFTK SATEAINLVA SSFGQAFINE
GDEIVLSIME HHANIVPWNF LRERKGAVLK WVDVDDDGNF LIEEFEKALS PKTKIVAMTH
MSNMLGTITP VKEIIKIAHD RGIPVLIDGS QGAVHLEVDV RDLDADFYVV TGHKLYGPTG
IGALYGKKEW LEKLPPFLGG GEMINEVTRD RVTYNEPPHR FEAGTPPIIQ AIGLGAAVDY
MQKLGRNRIH AHEMALSDYA HERLSKINSL KIFGRAKGKG AIISFEMKNA HAHDVATIID
RSGVAVRAGT HCAQPLLARF GVTSTCRASF AAYNTFEEVD KLAEALIRAE GLFA