Gene Bind_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3521 
Symbol 
ID6199359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3999678 
End bp4000841 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content61% 
IMG OID641707476 
ProductSel1 domain-containing protein 
Protein accessionYP_001834567 
Protein GI182680421 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.894704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.280761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGG TCAAAAGCGG CTGCGAGCAT TCTTGCAAGA GTCATTGGAT GAGAAACAGG 
GTTTTGCCGG TGAGCCTGGC CTTCAGTCTT GCCTTGCTCA GTGCTGGAAT GGATGCGCGG
GCACAAGGAC AAAAGGCTGG CAAGAGTGTC CCTCTCGCGC CGTCTGAGGC TCCCCTGATC
TCTATGCCCT TGAGCAATCC CTTGAGCGCA CCCATGGGCG CTCCTTGGGG CTCTTCCATG
TTGATGCCTA AAGCGACCTC GTCACGGCAA GAGGGCGTAA CGCTGGCCTC GCCCATCGAT
ACGCCCGACA AGCCGGGCGA GGGGCGCCCT GACGGTGATC TCGCCTTTGG AGCATTTCAA
CGCGGCCATT ACACGACGGC CCTGCGCGAG GCCATGAAGC GCATCGATGC AAATCCCAAT
GATGGGGCCG CCATGACTCT CATGGGCGAG CTTTATAGCC AGGGGCTGGG CGTGAGACGC
GACGCGACCG AGGCGGCGCG GTGGTACAAG CTCGCCGCTG ACCGGGGTGA TCGGCAGGGC
ATATTCGCGC TCGCCAGCGC CAAAATGCGT GGGGACGGCG TGCCGGAAGA CCGGCCGGGC
GCCAAAATCC TCTTCACCCA GGCGGCCGAG AAAGATCATG CCGGCGCTCT CTATAATCTC
GGCATTATGG CGATCGAGCA TAATGGCGTC GCCTCGGATT TCGTGACGGC GGCGCGTGAT
TTCGAAAAAT CCGCCAAGCT CGGCGATGCG GCATCGGCCT ATGCCTTGGG GCTCCTCTAT
CGCAACGGCA ATGGCGTCGA AAAGGACGAG GCCCGCGCCG CGTTCTGGAT CGGCCAGGCG
GCGGATAATG GCAATATCGA GGGCCAGATC GAATATGCGA TCATGCTGTT CAATGGCATT
GGTGTCGAAA AGAATGAGGC GGCGGCCGCC AAATATTTTC TGAAGGCTGC GGTGCAGAAC
AATCCCGTCG CGCAAAATCG CCTGGCGCGA CTGCTGATTG CCGGCCGGGG CGTGGCGCCC
AATCCCGTCG AAGCGATGAA ATGGCATTTG CTGGCACGCA CCGCGGGCCT CAAGGACGCA
TGGCTTGACG CGGAATTGAA CAAATTATCG CCCGATCAGC GCAAGGCGGT TGAGGCGGCG
CTGCGCCAAT ATGTCAGCAA TTGA
 
Protein sequence
MNRVKSGCEH SCKSHWMRNR VLPVSLAFSL ALLSAGMDAR AQGQKAGKSV PLAPSEAPLI 
SMPLSNPLSA PMGAPWGSSM LMPKATSSRQ EGVTLASPID TPDKPGEGRP DGDLAFGAFQ
RGHYTTALRE AMKRIDANPN DGAAMTLMGE LYSQGLGVRR DATEAARWYK LAADRGDRQG
IFALASAKMR GDGVPEDRPG AKILFTQAAE KDHAGALYNL GIMAIEHNGV ASDFVTAARD
FEKSAKLGDA ASAYALGLLY RNGNGVEKDE ARAAFWIGQA ADNGNIEGQI EYAIMLFNGI
GVEKNEAAAA KYFLKAAVQN NPVAQNRLAR LLIAGRGVAP NPVEAMKWHL LARTAGLKDA
WLDAELNKLS PDQRKAVEAA LRQYVSN