Gene Jann_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2048 
Symbol 
ID3934501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2052723 
End bp2053838 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID637904404 
Productpeptidase M24 
Protein accessionYP_509990 
Protein GI89054539 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.402173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.837706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCT ACAAAACCCG CCTCGCGCGC TTGCGGACCC GCATGGCCGA GACCGGCACC 
GACCTTGTGG TCCTTGGCCC CACCAGCCAC ATGGCGTGGC TATCGGGCGC GGATCCCCAT
GGCGATGAAC GTCCCGTGAT GCTGCTGGTC AGCCAAAGCC ATGCCGGGTT TCTGATGCCC
GCGCTCAATG CCAATTCCGT GCGGCAGGCC ACCGATCTAC CGTTCGAACC CTGGACCGAT
GAAGCCGGTC CGACCGACGC CCTGGGCCGG TTACTGGAGG TCTGCAACAT CTCTGGCCCC
GGCAAGACCG TGGCGCTGGA CGAGGCGATG CGCGCCGATT TCGCGCTGCT TCTGCTGGAC
GCGATGGAGG CCCCTGTGCG GCGGTTCTCC GGCGATACTC TGGGCCATTT GCGCGCGATG
AAAGACACCG CTGAGGTCGA GGCTTTGCGC ACCTGCGCGC ATCTCAACGA TGCGGCCGCC
TCCGCCGGGT TTGCGTCACT GCGCGCTGGC ATGACCGAGC GGGACGTGGC CACGATTATC
CGCGACCATT ACGTGGCCCA TGGCGCGAAG CCGGAATTCA CTATCGTGGC CTTCGGCGCA
AACGGCGCGT TCCCCCATCA CCATACCGGC GACACGGTTC TGCACGACGA TATGGCCGTG
CTGATTGATA CAGGCTGCCG GATCGGCGGC TATCCCAGCG ATATGACCCG GTGCGGTTGG
TTCGGCTCCG CACCCTCAGC CGAGTTCCTT CGTGTGGCGG ATGTGGTCGA GCGGGCGGTG
CAGGCCGCCA TCGCGGTCGT GTGTCCCGGT GTCCTTGCCC GAGAGATAGA CGCGGCGGCA
CGGGGCGTGA TTGAGGATGC GGGTTATGGC GACTTCTTCG TGCACCGCAC CGGTCATGGC
CTTGGGCTGG ATATCCATGA GCCACCATAC ATCACGGCCA CATCCGACAC CCTGATGCAG
GCGGGCCATG TCTTCTCCAT CGAGCCGGGG ATTTACCTGC CGGGACAGTT TGGCCTGCGG
CTGGAGGACA TCGTCATCGC GACCGACACC GGCGCGGATG TCCTGTCGGC CCTTCCGCGC
ACGATCGTGA CATCTGTGGA TGGCCCGGCC AGCTAA
 
Protein sequence
MDIYKTRLAR LRTRMAETGT DLVVLGPTSH MAWLSGADPH GDERPVMLLV SQSHAGFLMP 
ALNANSVRQA TDLPFEPWTD EAGPTDALGR LLEVCNISGP GKTVALDEAM RADFALLLLD
AMEAPVRRFS GDTLGHLRAM KDTAEVEALR TCAHLNDAAA SAGFASLRAG MTERDVATII
RDHYVAHGAK PEFTIVAFGA NGAFPHHHTG DTVLHDDMAV LIDTGCRIGG YPSDMTRCGW
FGSAPSAEFL RVADVVERAV QAAIAVVCPG VLAREIDAAA RGVIEDAGYG DFFVHRTGHG
LGLDIHEPPY ITATSDTLMQ AGHVFSIEPG IYLPGQFGLR LEDIVIATDT GADVLSALPR
TIVTSVDGPA S