Gene Sama_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2025 
Symbol 
ID4604275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2460445 
End bp2462355 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content46% 
IMG OID639781402 
Productprolyl oligopeptidase family protein 
Protein accessionYP_927900 
Protein GI119775160 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00251963 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT TTCTGTTAAG CCTTACCGCA GTTACCTGCA TGGCATCACA GCCACTCTAT 
GCGGAGCCAG TGACTGCGCA GTCACAAGCC GTGAGCACTA TTGAAGCCTT TTCTACTGCG
CCTCTTATTC GCAGTGTTGC TGTTTCAAGC GACGGAACCA AAGTAGCCAT CATCCGGGCC
ACGTCGAAGG AAGGTGACTA TGTGATTGAA ATCCGCGACA CGAGCAAACT CGATGCCCAA
CCAATCGTTT TGGGAGCAAA TCGTATGATG GTTTCGGGTG TCGTGTGGCT TAACAATGAC
AAGATTGGCG TAATGTTCCG TCAGATCTTG AAAGATGGTG CCAGAAGGTA TTGGGTAAAC
CGATTTGCAA TTACCGATGC AGATGGGAAA GGAGATTGGT TAATTCCGTT TCAAAACAAT
CCCCGCGCAG GTTTTTCTAT CCTGAATACC TTGCCTGATA ACAAGGATGA AATCCTGGTT
GAAACCGACC TGAACAACAA CTACAAGCCT GATGTTGTCC GTTTTAACGT AAATAATGGT
CGCACAGTCT CAGTTCTTCG AGGCAGTGAC AAAATTTCTG GTGGTTATAT CTCAGATGCA
GATGGTGATG TACGCGCTGC GACAGGCTGG AATTTGTCAG ACAATGCCAT CGACCTTTAT
GCCAGAGCGA AGGGCGATAG TGACTGGAAG CTCGTGAAAC AGATTTCACC CAAATCCAGA
GAAAGTTACA GCTTTATCGG CTTTTCAAAG GAAAATCCCG ATGAAATTTA TGTCAACGCC
AACCAAGGGC AGGACAAAAC CGGAATCTAC CTTTACAACA TTAAAACGGG CCAGTATTCC
GAACGTCTAT TCGGTTTGGA AAACGTCGAT GCGGATGGCG TGATCCAGAA AAAAGACGGT
AGCTTGGTTG GATTTGGTTA TTCAGCAAAA TGGCCTCAGC GCTATCTTAC CGATGCCAAT
GAACAAGCAA TTTATGATGG ATTGGATAAA CTGTTTGAAG GCAAGTTTGT TTCCATTATT
AGCCGTTCTA ATGATGACTC AGTCATGGTG GTCAGTGCGT CATCCGACAA GGATCCTGGG
ACTTATTACC TTGTTTTAAA CAAAAGTAAA ATTGAAACCA TCGGCAGTAA GTTGCCCTTT
GTCGACCAAG AAAAACTTGG TAGCGTCAAA TACATCTCTT ACAAAGCCCG TGATGGCCGT
AAAGTCTACG CATATGTCAC TCTTCCAAAA GGAAAAGGAC CGTTTCCTGC GGTAGTATTG
CCACATGGTG GCCCTTGGGT TCGAGACACA ATCGTATTTG ACGATTGGGC GCAATTACTG
GCTTCCAATG GTTATGTTGT TATCCAGCCC AACTACCGAG GTTCAACCGG TTATGGTATC
GAGCACTGGA CTGCGGGTGA CAACAATTGG GGCCTGAAAA TGCAGGACGA CCTTGATGAT
GCTGCCATGT ATCTGGTTGA AAAAGGACTG GCCACCAAGG ACAAGTTGGC CATGTTTGGT
TGGAGTTATG GTGGTTACGC CGCTTTCGCC GCGTCAATGC GCGATAATAA CATTTACCAG
TGTACTGTTG CCGGAGCGGG TGTCAGCGAT TTAAGTAAAA TCAATGCAAC CTTGAACGAA
AATCGGTTCT TAAGCCGTCT GCAACGTCCA ACAATCACGG GTGTTTCACC TCTTTCACAG
GTTGAGAAAG TCAATGTGCC AATACTCGTG GTACACGGTG ATATCGATGG ACGTGTGCCA
GTTGCCCACA GCCGTGAGTT TGTAGAAAAA CTCAAGGATT TGAAAAAGGA TCATAAATAC
GTTGAGCTGG TGGACGCTGA TCACTTTTCA GACACTTTGT TTTATGAACA TAAAATGGCG
TTTTACTCAG AACTTATTGA TTGGTTAGAC AATAAGTGCG GTCTGAAGTA A
 
Protein sequence
MKKFLLSLTA VTCMASQPLY AEPVTAQSQA VSTIEAFSTA PLIRSVAVSS DGTKVAIIRA 
TSKEGDYVIE IRDTSKLDAQ PIVLGANRMM VSGVVWLNND KIGVMFRQIL KDGARRYWVN
RFAITDADGK GDWLIPFQNN PRAGFSILNT LPDNKDEILV ETDLNNNYKP DVVRFNVNNG
RTVSVLRGSD KISGGYISDA DGDVRAATGW NLSDNAIDLY ARAKGDSDWK LVKQISPKSR
ESYSFIGFSK ENPDEIYVNA NQGQDKTGIY LYNIKTGQYS ERLFGLENVD ADGVIQKKDG
SLVGFGYSAK WPQRYLTDAN EQAIYDGLDK LFEGKFVSII SRSNDDSVMV VSASSDKDPG
TYYLVLNKSK IETIGSKLPF VDQEKLGSVK YISYKARDGR KVYAYVTLPK GKGPFPAVVL
PHGGPWVRDT IVFDDWAQLL ASNGYVVIQP NYRGSTGYGI EHWTAGDNNW GLKMQDDLDD
AAMYLVEKGL ATKDKLAMFG WSYGGYAAFA ASMRDNNIYQ CTVAGAGVSD LSKINATLNE
NRFLSRLQRP TITGVSPLSQ VEKVNVPILV VHGDIDGRVP VAHSREFVEK LKDLKKDHKY
VELVDADHFS DTLFYEHKMA FYSELIDWLD NKCGLK