Gene Sama_2779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2779 
Symbol 
ID4605026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3321415 
End bp3322554 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content59% 
IMG OID639782190 
Productrenal dipeptidase family protein 
Protein accessionYP_928651 
Protein GI119775911 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATC TTCACCGGCG CAGCCTGATA AAAGCCCTCG GCGCCAGCGC CCTCTTATCT 
GCCCTGCCCA CCGGCGTGCT TGCCGCCAAG TCACTGCGAC CACTCTACAT AGATGGCCTG
TCGTTTTTAC CGGACTCTCT GGACGATCTC GCCGCCTCCG GTCTTTCGGC CTACCTGTGC
GATATCTCCG CCATCGAAGA AGTAAAACAG GAAGATGGCA CCCTCAACTA CAAGCGCACC
TACAACGCCT GCATCAAGTC GATTGCCGAC GCCGGAAAAC GCGTCAGCGA TAACCCGGGG
CAGCTGCTGC AGGGGCTTTC AGCCAAAGAC ATCAAAAACG CCCGTGAGTC GGGCCGCACC
GCGGTCTTTT TTCAAATTCA GGGGGCAGAC TGCGTAGAAG AACGCCTGTC ACAGGTGGAT
GAGTTCTACC AAAAGGGCCT CAGGGTAATG CAGCTCACCC ATCACTATGG CAACAGCTTT
GCCGGTGGCG CACTGGACAG CGATGAGCAC GGCGGCCTCA ATCTCCCCCT GAGCCCCAAG
GGCTATGCCC TGGTGGATAA GCTCAACGAC AGCGGCATTC TCATCGACCT GAGTCACTCC
AGCCCTCAAA CCGCGCTGGA CACCATAGCT GCCTCCCGCA TGCCGGTGGT GCAAAGCCAC
GGTGCGGCCC GTGCCATCGT CAACCATGCC CGCTGTTCAC CGGATCAGGT GATCCGCGCC
ATCGCAGACA GTGGCGGTGT ATTCGGGACC TTTATGATGA GCTTTTGGCT GACCACCAGC
AGCACTCCCA CGGTTGAGCA CTATCTGGCA CAGCTGAAGC ACGTGGCCAG GGTGGGAGGT
ATCGACGCGG TCGCCATTGC CAACGACTAT CCCCTGCGCG GCCAGGAAAA CCTGCTCAAA
CTCAACAATG ACAACGCCGA AGGGGTGAAG GAGTATCTGG ACTGGTGGCA CAGCCTGCGG
GCCAAAAAGG TACTCGGCTT CGACCATGAG CCGGTGCACG TGGTTATCCC CGAACTCAAT
CACATTGAGC GCATGAGCCG CATCCACGAT GCCCTCAAGG ATGCAGGCTT CAGCGCCGCT
GACGCCGATA AAATCATGGG CGGCAACTGG CAGCGGGTAT TGCAGCAGGT ACTGGTGTAA
 
Protein sequence
MTNLHRRSLI KALGASALLS ALPTGVLAAK SLRPLYIDGL SFLPDSLDDL AASGLSAYLC 
DISAIEEVKQ EDGTLNYKRT YNACIKSIAD AGKRVSDNPG QLLQGLSAKD IKNARESGRT
AVFFQIQGAD CVEERLSQVD EFYQKGLRVM QLTHHYGNSF AGGALDSDEH GGLNLPLSPK
GYALVDKLND SGILIDLSHS SPQTALDTIA ASRMPVVQSH GAARAIVNHA RCSPDQVIRA
IADSGGVFGT FMMSFWLTTS STPTVEHYLA QLKHVARVGG IDAVAIANDY PLRGQENLLK
LNNDNAEGVK EYLDWWHSLR AKKVLGFDHE PVHVVIPELN HIERMSRIHD ALKDAGFSAA
DADKIMGGNW QRVLQQVLV