Gene Dshi_2721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2721 
Symbol 
ID5713620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2882651 
End bp2884507 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content69% 
IMG OID641268646 
Productputative metallopeptidase 
Protein accessionYP_001534055 
Protein GI159045261 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0993196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTT CTGCCCGCAA CGCTAGATTG TACCGGCAAG AAACAACCAA AGCACCCATG 
TTCCAATCTT TTTCCGCCAC CACCACCCCC GATCAGGGGC CACCCCGTCT TGCGGCCCTG
CGCGCCGAAA TGGCCGCGGA GGAGCTCGCG GGTTTTCTCG TGCCGCGCGC CGACGCGCAT
CAGGGAGAAT ACGTGGCCCC GCGCGACGAC AGGCTTGCGT GGCTCACGGG GTTCACCGGC
TCCGCGGGGT TCTGCATCGC CCTGGCCGGG ACCGCAGGGA TCTTCATCGA CGGGCGCTAC
ACCCTGCAGG TCCGCGCGCA GGTCGACAAC GGGGCATTCA CCCCGGTTCC CTGGCCCAAG
ACGCAACCGG GCCCCTGGCT GCGCGAAGCG CTTCCCACCG GGGTGATCGG GTTCGACCCC
TGGCTGCATA CCAATGCCGA GATCGCGCGG CTGGAGGCGA GCCTCGGCGA CGCGCTGTCC
TTGCGCCGCA CGGACAACCT GATCGACAGG ATCTGGCCGG ATCAGCCCGC GCCACCCCAA
GGCGCGGTCA TCGTCCATCC CGATAGCCTG GCCGGTCGCA GCAGCGCCGA GAAGCGCAGG
TCCCTCGCGC AGCACCTGAC CGAGTCCGGG GCAAAATCCG TGGTCCTCAC CCTGCCCGAC
AGCCTGTGCT GGCTGCTCAA CATCCGCGGC GCGGATATCC CACGCAACCC GGTGGTCCAT
GCCTTCGCGG TCCTACATGA CGATGCAAGC TGCGATCTCT TCATCGATCC GGCCAAGCTC
GATGACGATC TGCGCGCCCA TCTCGGGCCC GAGATCCGCT GCCACCCGCC GCACGACCTG
GCCGCAGCCC TCGGCGCGCT GGCCGGTCCG GTCCAGGTCG ACCCGAACAC CGCGCCTGTC
GCGATCTTCG ACCTGATGGC CGCCCAGGAC ACCCCGGTGA TCGAGGCCGA CGACCCCTGC
ATCCTGCCCA AGGCCTGCAA GACCGCGGCC GAGATCGCGG GCACCACCGA GGCGCACCTG
CGCGACGGGG CAGCGGTTGT CGAGTTCCTC ACCTGGTTCT CGGGTCAAAA CCCCGCGGAG
CTGACCGAAA TCGACGTGGT CATGGCGCTC GAAGCCGCCC GGCAGGCCAC GGGCGCCCTG
CGCGACATCA GTTTCGAGAC GATCTGCGGC ACTGGCCCGA ACGGCGCCAT CGTCCATTAC
CGCGTGACCG AAGGCACCAA CCGGCGGATC ACCCCCGGCG ATCTGCTGCT GATCGACAGC
GGTGGCCAGT ATGCGGACGG GACGACCGAC ATCACCCGCA CGCTGGCCAC AGGCACCCCG
CCGGAGGGCG CCAGAGCCGC CTTCACACGG GTCCTGCAGG GCATGATCGC CATCAGCCGC
GCGCGCTGGC CCAAGGGGTT GGCAGGTCGC GACCTGGACG CGCTGGCCCG CGCCCCGCTG
TGGATGGCCG GGCAAGATTA CGACCACGGC ACCGGGCACG GTGTGGGCAC CTATCTGTGC
GTCCACGAAG GCCCGCAGCG GCTCAGCCGG ATCAGCGAAG TGCCCCTCGA GTCGGGCATG
ATCCTCAGCA ACGAGCCCGG CTATTATCGC GAAGGCGCCT TCGGCATCCG GCTAGAGAAC
CTCGTCGTCG TCACGCAGGC CGACCCGCCC GAGGGCGGCG ATCCGCAACG CGAGATGTTG
CGCTTCGACA CCCTGACTTA CGTCCCGCTC GAGACCGCCC TCATCGACAC CGCGATGCTG
TCGCAGGCCG AGATCGACTG GATCGACACC TATCACGCGG AAACCCGCCA GCGCCTCCGG
GACCGGCTGA CGCCCGAGGC GCGTCGCTGG CTGGACAGGG CAACGCGCCC GCTGTGA
 
Protein sequence
MAVSARNARL YRQETTKAPM FQSFSATTTP DQGPPRLAAL RAEMAAEELA GFLVPRADAH 
QGEYVAPRDD RLAWLTGFTG SAGFCIALAG TAGIFIDGRY TLQVRAQVDN GAFTPVPWPK
TQPGPWLREA LPTGVIGFDP WLHTNAEIAR LEASLGDALS LRRTDNLIDR IWPDQPAPPQ
GAVIVHPDSL AGRSSAEKRR SLAQHLTESG AKSVVLTLPD SLCWLLNIRG ADIPRNPVVH
AFAVLHDDAS CDLFIDPAKL DDDLRAHLGP EIRCHPPHDL AAALGALAGP VQVDPNTAPV
AIFDLMAAQD TPVIEADDPC ILPKACKTAA EIAGTTEAHL RDGAAVVEFL TWFSGQNPAE
LTEIDVVMAL EAARQATGAL RDISFETICG TGPNGAIVHY RVTEGTNRRI TPGDLLLIDS
GGQYADGTTD ITRTLATGTP PEGARAAFTR VLQGMIAISR ARWPKGLAGR DLDALARAPL
WMAGQDYDHG TGHGVGTYLC VHEGPQRLSR ISEVPLESGM ILSNEPGYYR EGAFGIRLEN
LVVVTQADPP EGGDPQREML RFDTLTYVPL ETALIDTAML SQAEIDWIDT YHAETRQRLR
DRLTPEARRW LDRATRPL