Gene Dshi_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2973 
Symboldcp 
ID5710824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3139524 
End bp3141545 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content69% 
IMG OID641268899 
Productpeptidyl-dipeptidase 
Protein accessionYP_001534307 
Protein GI159045513 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC CGCTTCTCGA CACCTGGACC CCGCCCTACG GCCTGCCGCC CTTCGACCGG 
ATCGAGGATG CCCATTTCGC GCCGGGGCTG GAGGCGGCGC TGACCGAGGC GCGGGCCGAG
ATCGCGGCGA TCGCCGGGTC GGCCGAGGCC CCGACATTCG ACAACACGAT CCGCCCGCTG
GAGGCCGGGG CGCGCAAGCT GGGGCAGGTG GTGCGGGTGT TCTACCACGT CGCCGCCACG
GACAGCACGC CCGCGCGCGA GGCGCTGCAG AAGGACTTCA GCGCCAAGCT GAGCGCCTAT
AATTCCGAAG TGATCTCCAA CGCGGCGCTC TTTGCCCGGA TCGCGGCGGT CTGGGAGGGG
CGCGAGGCCC TCGGGGCCGA AGAGGCGCGG GTTGCGGAGC TGTATTACAA AGACTTCACC
CGGGCGGGCG CGGCCCTCAC CGGGGCCGAC AAGGACCGGA TGACGGAGAT CAAGGCGCGG
CTGGCGATGC TGGGGACGGA GTTTACCCAG AACCTGCTGG CGGATGAGCG CGATTGGGTG
ATGCCGCTGG CCGACGCCGA TCTGGAGGGG CTGCCGGAGT TCGTCGTCGC CACGGCGCGC
GCGGCGGCCG AGGAGCGCGG GATGGAGGGG CATGTCGTGA CCCTGTCGCG GTCTCTGATC
GTGCCGTTTT TGCAATTCAG CCCGCGCCGG GATTTGCGCG AGAAGGCCTA TGAGGCCTGG
GTCGCGCGGG GCGAGCATGA TGGCCCCACG GACAATCGCG GCATCGCGGC AGAGGTTCTG
GCCCTGCGGG AGGAGCGCGC CAAACTGCTC GGCTACGACA GCTTCGCGGA CTACAAGCTG
GAGCCGGAGA TGGCCAAGAC CCCGGCGGCG GTGCGCGATC TGCTGATGGC GGTCTGGGCC
CCGGCCAAGG CGGCGGCGGA GGCCGATGCC GAGGTGCTGA CCGCGATGAT GCAGGAGGAT
GGGGTCAACG GGCCGTTGGA GGCCTGGGAC TGGCGCTATT ACTCCGAAAA GCGGCGGCAG
GCGGAGCATG ACCTGGATGA GGCGGAGCTG AAGCCGTACC TGCAACTCGA CAAGATGATC
GAGGCGGCGT TCGATTGTGC CGCGCGCCTG TTCGGGCTGT CCTTTGCCCC CATCGATGCA
CCGCTGGCCC ACCCGGACGC ACGCGCCTGG GAGATCCGGC GCGGCGAGCG GCTGATGGCG
GTGTTCGTGG GGGATTACTT CGCACGGGCG GGCAAGCGGT CGGGGGCCTG GTGCGGGTCG
CTGCGCGCGC AGCACAAGCT CGACGGGGAT ACCCGCGCGA TTGTCACCAA TGTGTGCAAC
TTCGCCAAGC CGGCCAAGGG GCAGCCCGCG CTGCTGTCGT TCGACGATGC GCGGACGCTG
TTTCATGAGT TCGGCCATGC GCTGCATCAT ATCCTGTCGG ACGTGACCTA TCCGATGATC
TCGGGCACCT CGGTGGCGCG GGACTTCGTC GAACTGCCGA GCCAGCTTTA CGAGCATTGG
CTGGAGGTGC CCGAGGTGCT GCGCGCCTTC GCGGTCCATG CGGAGACCGG CGCGCCGATG
CCCGCCGACC TGCTGGCGCG GATGCTGGCC GCGGCAACCT ATGACATGGG GTTCCAGACG
GTGGAATATG TGGCCTCGGC CCTGGTCGAC CTGGATTTCC ACGAGGGCGC GGCCCCCGCG
GACCCAATGG CGCGGCAGGC GGAGGTGCTG GCCAAGCTGG GCATGCCCCA CGCGATCCGG
ATGCGCCACG CAACGCCCCA TTTCGCCCAT GTGTTCGCGG GCGACGGCTA TTCTTCGGGG
TATTACAGCT ACATGTGGTC CGAGGTGATG GATGCCGATG CCTTCGCGGC CTTCGAGGAG
GCCGGCAGCG CCTTCGACCC CGACACCGCC GCCAAGCTGG AGGCGCATAT CCTGTCGGCG
GGCGGATCGG CGGAGGCAGA CGGGCTATAC CGCGCGTTCC GCGGCCGGAT GCCGGGGGTC
GAGGCCCTGC TCAAGGGACG GGGGCTGGAC AAGGCCGCGT GA
 
Protein sequence
MTNPLLDTWT PPYGLPPFDR IEDAHFAPGL EAALTEARAE IAAIAGSAEA PTFDNTIRPL 
EAGARKLGQV VRVFYHVAAT DSTPAREALQ KDFSAKLSAY NSEVISNAAL FARIAAVWEG
REALGAEEAR VAELYYKDFT RAGAALTGAD KDRMTEIKAR LAMLGTEFTQ NLLADERDWV
MPLADADLEG LPEFVVATAR AAAEERGMEG HVVTLSRSLI VPFLQFSPRR DLREKAYEAW
VARGEHDGPT DNRGIAAEVL ALREERAKLL GYDSFADYKL EPEMAKTPAA VRDLLMAVWA
PAKAAAEADA EVLTAMMQED GVNGPLEAWD WRYYSEKRRQ AEHDLDEAEL KPYLQLDKMI
EAAFDCAARL FGLSFAPIDA PLAHPDARAW EIRRGERLMA VFVGDYFARA GKRSGAWCGS
LRAQHKLDGD TRAIVTNVCN FAKPAKGQPA LLSFDDARTL FHEFGHALHH ILSDVTYPMI
SGTSVARDFV ELPSQLYEHW LEVPEVLRAF AVHAETGAPM PADLLARMLA AATYDMGFQT
VEYVASALVD LDFHEGAAPA DPMARQAEVL AKLGMPHAIR MRHATPHFAH VFAGDGYSSG
YYSYMWSEVM DADAFAAFEE AGSAFDPDTA AKLEAHILSA GGSAEADGLY RAFRGRMPGV
EALLKGRGLD KAA