Gene Sama_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1078 
Symbol 
ID4603330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1305235 
End bp1307031 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content55% 
IMG OID639780425 
ProductM1 family peptidase 
Protein accessionYP_926955 
Protein GI119774215 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000177882 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.812437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTG TTATGCACCA GTGGGACAAC CGGCACGACT ATCACTCTTT CGCCAATACC 
GACAGCATCA GGGTGACTCA CCTGTCACTT GATCTCGCCA TCGATTTTGA TACCAGATGC
CTTCAGGGCT GTGTTCGCCT GGATTTTGTA AGAAAGGAGG GGGATGCGGC CGATGTGTTG
GTGCTGGATA CCCGGGCACT GGCCATTAAG TCCATTACCG ATGTGCATGG TCAGCCACTG
GACTGGGGAC TGGGACAGGC CAGTGAGATT CTTGGGCAGG CTCTGGAAAT TATGCTGCCC
AATGGCATCA CCAGTGTGCT GGTGCATTAT CACACCACAG AGGATGCCGA GGGCCTGCAG
TGGCTCGATG GCCCTCAAAC TCAAAGTGGC AAACCCTATT TGTTCTCCCA ATCGCAACCC
GTGAACGCCC GCAGTTGGAT CCCGCTGCAG GATACCCCCA AGGCCCGGGT GACCTTTGAT
GCCAGAGTCC GTGCCAACCA GCCCTGCAGG GTGGTGATGA GTGCGCTCAA TCAGGCGGAT
ATGCCCGCAG ACGGTGTATT TGAGTTTGTG ATGGATAAAC CCATGCCGAC CCATTTGCTT
GCCATTGCCG CCGGCCAGAT TGACCGGGTG CCTGTGAGTG AGCGCAGTGC CGTATTTGCC
GAGCCTGCCA TGGCTTCGCT GGCCGCCCGG GAGTTTGAAG ATATTGAAGC CATGATGCAG
ATGGCCGAGT CGATTCTTGG GCCCTATGCC TGGGAGCGAT ACGACATGTT GATTCTGCCG
CCCAGCTTCC CATTTGGGGG CATGGAAAAC CCTTGTCTGG CCTTTTTGAC CCCCACTCTT
ATCGCGGGCG ACAAGAGTCT GGTGTCCACC GTGGCCCATG AACTGGCCCA TTCCTGGACA
GGTAATCTGG TGAGCAATGC CACCTGGCGC GATCTCTGGC TCAATGAAGG CTTTACCACC
TACTTTACCA ATAGAATTGT GGAAGCCGTT TACGGTCGGG AGCAGGCTCA GCTTGAACTC
ATGCTGGAGT ACGGCAGGCT GAAGGAAGAA ATGGCGGGTA TGCCGCTGCC ACGGCAAACC
CTGCCAGCCA ATTTGCAGCA GGACGATCCC AACGCCGCAT TCAATCGCTT TACCTACGAT
AAAGCGTCCA TGTTTGTGCA CTTTCTCGAG GCGCGCCTGG GCAGACCCGA CTTTGATGCT
TTTTTGCGGT CCTATATCGA GCACTATGCC TTTGTGGCCA TCACCACCGA AGACTTTGTC
GAATATGCCA AAGGGACTTT GCTGCAAACC CACCCAGATA AGGTGACTGA GGCAGAGCTC
AGGGAATGGA TCTATGGCGA AGGCTTGCCA GCGACCTTTA TGCCGCCTAT GTCAGAGAGT
TTGGGGTGGG TGATAGAGTC CATGACAGAG TGGCTGGAAG GGCATCCTCT GACACCGGAG
CGCCTGTTTG GCTGGCGGGT TCAGCATTGG CAGTTCTTTT TAAATAACCT GCCGGAGCAG
ATTTCCCAGG AGCAACTGCT GGAGCTGGAT GAACGCTTTG CCCTCGGGTC ATCCGGCAAC
GCCGAAATTG CCTGCGATTG GTTCAGGGTA GCTATCCGTA ACCATTACGA CCCGGTACTG
GAGCAGGTCG AAGCATTTCT GTGCCGTATT GGTCGGGCCA AGTTTGTCCG CCCACTGTTT
TTGGAACTAC AGATAGCCGG TTATCGACAG GAGCTTGAAG CCATCTATCA CAGGGCCCGT
GAGAGCTATC ACCCCTCACT GAGGGTGCAA CTCGACCGGA TACTGTTTAA CGAGTAA
 
Protein sequence
MDFVMHQWDN RHDYHSFANT DSIRVTHLSL DLAIDFDTRC LQGCVRLDFV RKEGDAADVL 
VLDTRALAIK SITDVHGQPL DWGLGQASEI LGQALEIMLP NGITSVLVHY HTTEDAEGLQ
WLDGPQTQSG KPYLFSQSQP VNARSWIPLQ DTPKARVTFD ARVRANQPCR VVMSALNQAD
MPADGVFEFV MDKPMPTHLL AIAAGQIDRV PVSERSAVFA EPAMASLAAR EFEDIEAMMQ
MAESILGPYA WERYDMLILP PSFPFGGMEN PCLAFLTPTL IAGDKSLVST VAHELAHSWT
GNLVSNATWR DLWLNEGFTT YFTNRIVEAV YGREQAQLEL MLEYGRLKEE MAGMPLPRQT
LPANLQQDDP NAAFNRFTYD KASMFVHFLE ARLGRPDFDA FLRSYIEHYA FVAITTEDFV
EYAKGTLLQT HPDKVTEAEL REWIYGEGLP ATFMPPMSES LGWVIESMTE WLEGHPLTPE
RLFGWRVQHW QFFLNNLPEQ ISQEQLLELD ERFALGSSGN AEIACDWFRV AIRNHYDPVL
EQVEAFLCRI GRAKFVRPLF LELQIAGYRQ ELEAIYHRAR ESYHPSLRVQ LDRILFNE