Gene Bind_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1930 
Symbol 
ID6200939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2200981 
End bp2202525 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content61% 
IMG OID641705919 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001833043 
Protein GI182678897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCTG TTCGAGGTTT TATTCTTCTC GCGACAATTT CGCTTCTCTT CACCCTCCCC 
TTGACCTTTC CTTCCTCGGC TAGGGCTGAA CAAACGGTCC CAATCCCGCG CGCGGAAACC
GAACAGACCA AGGATACCAC GCCGCCGCAT CCCCTGCCCG CTCCGGTTAC GACGACCCAT
ACGCTCGATC TGCCCGGACG CAGCCTGAAT TTTCAGGCCA TAGCCGGAGC GATCAAACTT
TCCGACGCGC AAAGCGGCAC ACCAGAAGCC GACATTGGTT TTACCGCCTT CCTCCTCAAC
GGCCAGGAAG CGTCACAGCG CCCCATCGTG CTGGTCTTTA ACGGCGGGCC GGGAGCTTCT
TCCGGCTGGC TTAATCTCGG CGCGCTCGGA CCGTGGCGGC TCAAAGCCGA CGCTCCTCTT
CTTGCGCCCT CACAACCGCC CATGCTCGTG CCCAATGCCG AGACCTGGCT CGATTTCGCC
GATCTCGTCT TTTTCGATCC GCCCGGCACC GGCTACAGCC GGCTTTACGG CAAGGATGAC
GAAGCCCGGC GGAGCTTTTT TTCCGTCAAT GGCGACATCA GCGCCTTGAG CGTCGCCATT
CGCAAATGGC TGGCCGAGCA TGATCGTCTC GCCAGTCCGA AATTCATCGT CGGTGAGAGT
TATGGTGGAT TTCGCGCGCC CAAACTCGCC CGTCGCCTGC AAGAAACAGA AGGCATCGGC
GTTTCGGGCC TCATCATGAT CTCGCCTGTC CTCGATTTCA GTTGGTTCGA GGGCGCCAAT
AATCCCCTCA TCGCGGTCGC GCGACTGCCA TCGCTCACCG CCACCGCGCG CGGACTCGAT
GGAGGCGCGA GCCGAGCCGA TCTCGCCGAT GTGGAAGCCT ATGCAAGCGG CCCCTATCTC
GTCGATCTCC TGCGCGGCGA ACGCGATCCC GCCGCGCTCG ACCGGCTGGC GGCCAAGGTT
TCCGCATTCA CCAAGCTCGA TCCCACTCTG GTGCGTCGGC TCGGCGGCCG TATCGATCTT
GCGACGCTCT CACGTGAGCG CAAGCGCGAT GAAGGCAAAG TCGCAAGCCT CTATGACGCA
CGCATTCTCG GCTATGATCC TGATCCCCAT CAGGCCTCGA GCGATTATGC CGATCCAATC
CTGGACGCTT TGCGCGCACC TCTCGCCAGC GCCATGGCGG ATCTCATCGC GCATCGCTTG
AACTGGCCGA TCGAGGCTCG CTATGAAATT CTCAACGACA ATGTCAATCG GCAATGGAAT
TGGAACCCGG ATCGGGGCCA TGCCCATGCC CAGGCGGAAT CCTTAAGCGA CCTCAAACAT
GTGATGGCGC TCGATCCGCG CCTGCGCGTG CTGGTCATCC ATGGATTGAG CGATATTGTA
ACGCCCTATT TCGCCTCGAA ACTTCTGCTC GACCAAGTCG CACCCATGGG CGATCCTGAT
CGCCTGCGTT TGTCGGTCTA TCCAGGCGGC CACATGCTCT ATCTCGAAGA GACGAGCCGA
GCGAAATTGC GCGAGGATGC GGCCAAGCTG ATCACCGGTC CCTGA
 
Protein sequence
MRPVRGFILL ATISLLFTLP LTFPSSARAE QTVPIPRAET EQTKDTTPPH PLPAPVTTTH 
TLDLPGRSLN FQAIAGAIKL SDAQSGTPEA DIGFTAFLLN GQEASQRPIV LVFNGGPGAS
SGWLNLGALG PWRLKADAPL LAPSQPPMLV PNAETWLDFA DLVFFDPPGT GYSRLYGKDD
EARRSFFSVN GDISALSVAI RKWLAEHDRL ASPKFIVGES YGGFRAPKLA RRLQETEGIG
VSGLIMISPV LDFSWFEGAN NPLIAVARLP SLTATARGLD GGASRADLAD VEAYASGPYL
VDLLRGERDP AALDRLAAKV SAFTKLDPTL VRRLGGRIDL ATLSRERKRD EGKVASLYDA
RILGYDPDPH QASSDYADPI LDALRAPLAS AMADLIAHRL NWPIEARYEI LNDNVNRQWN
WNPDRGHAHA QAESLSDLKH VMALDPRLRV LVIHGLSDIV TPYFASKLLL DQVAPMGDPD
RLRLSVYPGG HMLYLEETSR AKLREDAAKL ITGP