Gene Jann_2343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2343 
Symbol 
ID3934799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2349835 
End bp2350995 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content64% 
IMG OID637904701 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_510285 
Protein GI89054834 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0732987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.851404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGTCA TCAATTCCAT CGCCGCCATG GCCCCAGAGA TGAAAACCTG GCGCAGACAT 
CTGCACGCGC ATCCTGAATT GAGCTTCGAC TGCCATGGGA CGGCGGCCTT CGTGGTGGAC
CGGCTGAAAG CGTTCGGCAT CACCGACATC CATGAGGGGA TCGCGACCAG CGGTCTTGTG
GCGATCATCG ACAGCGGGCG AGCGGGGCCC ACAATCGGCC TGCGGGCCGA TATGGATGCC
TTGCCGATCC TGGAGGCGAC GGGGGCGGAG CATGCCTCTA CGGTGCCGGG CAAAATGCAC
GCCTGCGGCC ATGATGGTCA TACGGCGATG CTTCTGGGGG CGGCGAAATA TCTGGTGGAG
ACGCGGAATT TCACTGGTCG TGTGGCGTTG ATCTTCCAGC CAGCCGAGGA AGACGGCGGC
GGCGGGGAGG TCATGGTGCA GGAGGGCGCG ATGGACCGGT TCGACATCAG CCGCGTCTTC
GCCATCCACA ACATCCCCGG CGCGCCGGAA GGGAGCTTCT TCACCACACC CGGCCCGATC
ATGGCCGCGG TCGACACCAT TACGGTTGAG ATTACCGGAC AGGGCGGGCA CGGGGCCTAT
CCGCAGGACA CCATTGACCC GATCCCGCCC GCCATGGCCA TTGCGCAAGG TTTTGGGACC
ATCGTGTCGC GCAACACCCG CTCCCTCGAC GATCTGGTGA TCTCGGTCAC GCAGATCCAC
GCAGGCGACG CCAGCAACGT GATCCCGTCC CATGCCATGA TCAATGGCAC CGTCCGCACG
TTTGATCCAG CAGTGCAGGA CATGGTGGCG CGCCGTATGG GCGAGATCGT CGATGGCACG
GCCGCGGCCT ACGGCGTCAC CGCCAAGCTG ACCTATGAGC GTGGCTACCC CGCGACCATC
AATGACCCAG ACCAGACGGC CTTTGCCGTC GGCGTCGCGC AGGAGGTGGT GGGCGAGGGC
GCGGTCATCG ACAATTCCAA CCGCGAGATG GGGGCGGAGG ATTTCTCCTA CATGCTGCAA
GCCCGCCCCG GCGCGTATTT GTTTCTGGGC GCGGGCGAGG GTGCGGGGCT GCATCACCCT
GGATTTGACT TCAACGACGA TATCGCACCA ATCGGGGCCA GTCTGCTGGC AAAAATCGTG
GAGACGGCCA ATCCCGCATA G
 
Protein sequence
MPVINSIAAM APEMKTWRRH LHAHPELSFD CHGTAAFVVD RLKAFGITDI HEGIATSGLV 
AIIDSGRAGP TIGLRADMDA LPILEATGAE HASTVPGKMH ACGHDGHTAM LLGAAKYLVE
TRNFTGRVAL IFQPAEEDGG GGEVMVQEGA MDRFDISRVF AIHNIPGAPE GSFFTTPGPI
MAAVDTITVE ITGQGGHGAY PQDTIDPIPP AMAIAQGFGT IVSRNTRSLD DLVISVTQIH
AGDASNVIPS HAMINGTVRT FDPAVQDMVA RRMGEIVDGT AAAYGVTAKL TYERGYPATI
NDPDQTAFAV GVAQEVVGEG AVIDNSNREM GAEDFSYMLQ ARPGAYLFLG AGEGAGLHHP
GFDFNDDIAP IGASLLAKIV ETANPA