Gene Sde_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1643 
Symbol 
ID3965159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2107762 
End bp2109066 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content48% 
IMG OID637920724 
ProductXaa-Pro dipeptidase 
Protein accessionYP_527115 
Protein GI90021288 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAC GTGATGATAT GACCTTCCCC GCTGATGAGT ACGAACGCAG AATTAATGAA 
TTACGGGCTC GCATAGCAGA GCGTCACTTA GATGCCGTAG TCATTTCCGA CCCCGAAAAT
ATCATGTACT TAACAGACTA TCAAACAACG AGTTACTCTT TTTTTCAGGC CTTAGTTGTC
CCGTTAGAAA AAGAACCGTT TATGATTACC CGCGCGCTAG AAGAGTCCAA CGTAATTGCA
CGTACTTGGG TGGATTTAAC CCGACCCTAC CCAGACACTG GCGACGCCAT GCAAATGCTT
GTGGATGCAT TAAAGGAGTT TGGCTTAGCC AACAAACGCA TTGGCTACGA ACGCAACAGC
AACTTCTTCC CCGCTTATCA TCAAGATGTA ATTCACACCA CGCTCAAAGA CGGTAAATTA
TTAGACTGTT TTGGCATTGT TGAAGAAGGC CGCATTTGTA AATCTGCTGT CGAAATTGCG
TTAATGAAAA AAGCCGCCTT GGCAACAGAA GCCGGTATGG CTGCAGGCAT TGCCGCTTGC
CGCGCTGGCG CGACAGAAAA TGAAGTTGGT GCAGCGATCA GTGCCGGCAT GTTTAACGCG
GGCGGAGAAA CGCCTGCTGT TATGCCCTAC GTAACCTCAG GGCCACGAAC GATGATTGGT
CACGCCACGT GGGAAGGCCG AGTAATACAA GATAACGAGC ACGTATTTTT AGAAGTGGGA
GGCTGCTACC GGCGCTACCA TACCGCCATG ATGCGCACCG TTATTCTTGG CGACCTTACC
GACTCTATGC ACATAGCCCA AGAAACAATG AAACGCGCGC TTGATGCTGT GCATTCCATG
GTGCAACCGG GTATGACAGT GTCCGACGTG GACAACCTCG TAAGAAATAT TATTAGCGAC
AACCCAGTCG GTGCACGACT TATTACACGC TCCGGTTACT CCATTGGTAT CGCCTTCCCT
CCAAGCTGGG ATGAAGGCTA TATAGTAAGT TTAAAACAAG GAGAATCCAC AGTACTTAAA
GAAGGCATGA CCTTTCATTT AATTCCTTGG ATGTGGGGCG TGGACGGTGA CAAAACTTGC
GGCATTTCCG ATACGATCCA TATAACCGAC GAAGGCTGCG AATCGTTCTT TAGCATGGAG
CGAGACTTTA CCGTTAAACC CAGCGAAAAC GGTATAGCAG CAATAGCAGC ACCCCAAACA
GCGAGTTGCG TAGATTTATC CACAGCGCGA AACAGCACAA CACCTAGCAA AAAAACCGCA
AAAAAAATAA CGGCTAAGGA GGTTGACAAT GCTAGCGCCA GTTAA
 
Protein sequence
MIKRDDMTFP ADEYERRINE LRARIAERHL DAVVISDPEN IMYLTDYQTT SYSFFQALVV 
PLEKEPFMIT RALEESNVIA RTWVDLTRPY PDTGDAMQML VDALKEFGLA NKRIGYERNS
NFFPAYHQDV IHTTLKDGKL LDCFGIVEEG RICKSAVEIA LMKKAALATE AGMAAGIAAC
RAGATENEVG AAISAGMFNA GGETPAVMPY VTSGPRTMIG HATWEGRVIQ DNEHVFLEVG
GCYRRYHTAM MRTVILGDLT DSMHIAQETM KRALDAVHSM VQPGMTVSDV DNLVRNIISD
NPVGARLITR SGYSIGIAFP PSWDEGYIVS LKQGESTVLK EGMTFHLIPW MWGVDGDKTC
GISDTIHITD EGCESFFSME RDFTVKPSEN GIAAIAAPQT ASCVDLSTAR NSTTPSKKTA
KKITAKEVDN ASAS