Gene Meso_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_2201 
Symbol 
ID4181895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp2349822 
End bp2350949 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID638068097 
Productpeptidase M50 
Protein accessionYP_674758 
Protein GI110634550 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTGGT CATTCAATAT CGGTAAGTTC AGCGGCACCG TTGTGCGCGT GCATGTGACG 
TTCCTGCTGC TGCTCGTCTG GATCTGGTTC ATGCATTACC GGATCGGCGG CGCGGCCGCC
GCATGGGAGG GCGTGGCCTT TATCATCGCG GTCTTCGCCT GTGTCGTGCT GCACGAATTC
GGCCATGCGT TCGCCGCCCG CCGATACGGC ATCAAGACGC CAGATATCAC ACTTCTTCCG
ATCGGCGGGC TCGCCCGACT CGAGCGCATG CCGGAAGAGC CCGGCCAGGA ATTTGTGATC
GCGGTGGCGG GCCCGCTGGT CAATGTGATT ATCGCGGCGA TTATTTTCCT AGCGGTGGGC
GGTTCCGCCG GAATAGAGCA GATGATGCAG GTGGAAAATC CGCGCACGAG CTTTCTCGTG
CGGCTTGCCG GCGTGAACGT GTTTCTCGTG CTCTTCAACA TGATCCCCGC CTTTCCCATG
GATGGCGGCA GGGTGTTGCG CGCCATTCTC GCCACGCGCA TGACCTGGGC GCGCGCCACG
CAGATCGCTG CCAATATCGG CCAGGGGCTC GCCTTCCTGT TCGGCTTTCT CGGTCTATTC
TACAATCCGC TGCTCATCTT CATCGCAATC TTCGTCTATC TCGCCGCCGC CGCCGAGGCG
CAGAACGCGC AGATCCGGGA CATCTCAAAC AGCGTGCTGA CGGGCGATGT GATGGTGACG
GAATTCGCCT CTCTCGAGCG CTCATCGACC ATCGCCGAGG CCATCGACCG CCTGCTGGCG
ACCACGCAGA GCGAATTCCC CGTGCTCGAC GCCGATGGTC ATCTGACCGG TCTGCTCACC
CGAAACGACA TGATCGCCGC CTTGAAGGAG ACAGGGCCCG ACGCGCCGGT GGTGAGCGTG
ATGCGCACGG ATGTGCCGAG CGTTCACCGT CTTCAGAGCC TTACTGACGG GTTCCGGCTG
ATGCAGGAGA AAAGCGCGCC CGCCGTCGCC GTGGTGGACA GCGGCGGCCG GCTTGTCGGC
TTGCTCACGC ACGAGACCAT TGGCGAAATG ATGATGGTGC GCGCGGCCAT GCCAGAGGGC
TTTCGCTTCG GCCGTTTGCG GCAGACCCCC TTCATTCGCC GTCCTTGA
 
Protein sequence
MAWSFNIGKF SGTVVRVHVT FLLLLVWIWF MHYRIGGAAA AWEGVAFIIA VFACVVLHEF 
GHAFAARRYG IKTPDITLLP IGGLARLERM PEEPGQEFVI AVAGPLVNVI IAAIIFLAVG
GSAGIEQMMQ VENPRTSFLV RLAGVNVFLV LFNMIPAFPM DGGRVLRAIL ATRMTWARAT
QIAANIGQGL AFLFGFLGLF YNPLLIFIAI FVYLAAAAEA QNAQIRDISN SVLTGDVMVT
EFASLERSST IAEAIDRLLA TTQSEFPVLD ADGHLTGLLT RNDMIAALKE TGPDAPVVSV
MRTDVPSVHR LQSLTDGFRL MQEKSAPAVA VVDSGGRLVG LLTHETIGEM MMVRAAMPEG
FRFGRLRQTP FIRRP