Gene Sama_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1749 
Symbol 
ID4605966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2139025 
End bp2141085 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content54% 
IMG OID639781113 
Producthypothetical protein 
Protein accessionYP_927624 
Protein GI119774884 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.624168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0119048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG CTACCCATAA TCCAAATTGT TTTAACCAAA GCAACAATAC CCCATTTGCC 
GAGGTGCTGG ACCGTCATCT TTCTCGCCGC AACTTTGTAA AAAGCGGTCT GGGCCTTGGC
GCCATGACCG CATTTGCGGG TTTGGGCCTC GCAGGCTGTG GCAGTGATTC AAGCCCGGTA
ACGCCTCCAG TGACCACACC AACACCACTC CCTCCACCAA CCCAGTCAGA CATCACTCTG
GGCTTTGACT CCATTCCCGG TTCGCTCACC GATGGCGTTT CCATTCCCCA AGGCTACAGA
GCTCAGGTGC TGGCACCCTG GGGAACCCCG TTGAATGATA AGGCTGCCCC CTGGAAAGAC
GATGGCAGTA ATACGTCGGA CGATCAGGCC AATGCAGTGG GGCAGAACCA CGACGGTATG
CACTTTTTCC CACTGAATGA TGCCGCCGAC GATGGTCTGC TGTGCATCAA CCATGAATAC
ATTGAACCAG ATGCACTGCA TCCAACTGGC CCTAGTGTTG ACCCGGTCAC GGGCCTTCGC
ACCATCCTTG ATGAAGTGCG CAAGGAAATC AACGCTCACG GCGTGACTGT GGTACGTATC
AAACGCACCA ATGGCGTATG GGAAGTCATT AAAAACGATC CCCATAACCG CCGTTTTACC
GGTGCTACCA CATTTGATAT GTCAGGCCCT GTTGCATACA GCGATCATCT GGTTACCGCT
TTCTCGCCCG ATGGCAGTCA GACCCGAGGT ACCCTGAACA ATTGCGGTAA CGGTTCAACC
CCTTGGGGCA CCTACCTGAC CTGTGAAGAA AACTGGCCAG AATACTTCGT CAACAAAGGC
GAAATGTCTG CCGCCAACGC CCGTATCGGT GTTGCAACCA AGGACACCCG CTACGGATGG
AACCACTTTG CCGGCCACGA TGAAGAGCGG GCAGATGAAT TCAGCCGCTT CGACATCACC
CCAACCGGAA TAAGTGCCCT CTACGATTAC CGCAATGAAG CCAATGGTCA CGGCTATATC
GTTGAAATCG ACCCCTACAG TCCTAATCAG CGCGGTATCA AACGTACGGC CCTGGGTCGC
TTCCGCCATG AAGGGTGCAC CTTCGGTAAA CTCGAAGAGG GTAAGCCCGT TGTGTTCTAC
TCAGGGCACG ATTCCCGCTT TGAGTATCTT TACAAGTACG TATCAGATGC ACTTTGGTCT
GAGGCCGATG CCAACAGCAG CAACCGCATT GCTACCGGTG ATAAGTACAT GAATGACGGT
ACCCTCTACG TCGCACGCTT CGATGAGGAT GGCCAGGGCG AGTGGTTGCC ATTGGTGCCG
GGAACTGTCA CCACCGATGG CAAAACCCTG GGTGAGCATT TGGGGACACT TGCAGACATC
ATAGTCAACA CTGCAGGAGC AGCCGATCTG GTGGGGGCGA CCCCAATGGA TCGCCCAGAA
TGGTGTGCTG TCGACCCCTT CACCGGCAGT GTGTACCTGA CGCTGACAAA CAATACCCGA
CGCACCGAAG CCAATCCGGC CAACCCCCGC TTGAAAAATA GCTTTGGTCA CATCATTCGT
TGGAATGAAG GCGCCTCTGA TACCGAATTC AGTTGGGATA TCTTTGTCTT TGGTTCTCCG
GCCAACGGTG ACAGTGAAAC CAATCTCAGT GGCCTGACCG ATGAGAACCA GTTTGCAAGC
CCGGACGGCC TGGCATTTGA CCCACGTGGC ATCATGTGGG TACAAACCGA CAACGGAGCC
AAGGAAGTGA CTGAACACAC CAACGACCAG ATGCTGGCTG TGGTGCCCTC CAGGTTGCTC
GACGCAGAGG GCAATCAAAA ACCACTTCGC GCCGATAACC AAATGGAACT GCGCCGTTTC
TTCGTGGGTC CCAACGACTG TGAAGTGACA GGTGTAGCCT TCAGCCCTGA TTATCAAAGC
CTGTTTGCCA ATATCCAGCA CCCCGGCAAC TGGCCTTACA GTGACGATGC TACCCAGATT
ACCCCGGCCG GTATTAGCAT CAGGCCAAGG GCAGCCACTG TGGTTATCTC AAAAGTTGAT
GGCACCGAAG TGGGCGTGTA A
 
Protein sequence
MSKATHNPNC FNQSNNTPFA EVLDRHLSRR NFVKSGLGLG AMTAFAGLGL AGCGSDSSPV 
TPPVTTPTPL PPPTQSDITL GFDSIPGSLT DGVSIPQGYR AQVLAPWGTP LNDKAAPWKD
DGSNTSDDQA NAVGQNHDGM HFFPLNDAAD DGLLCINHEY IEPDALHPTG PSVDPVTGLR
TILDEVRKEI NAHGVTVVRI KRTNGVWEVI KNDPHNRRFT GATTFDMSGP VAYSDHLVTA
FSPDGSQTRG TLNNCGNGST PWGTYLTCEE NWPEYFVNKG EMSAANARIG VATKDTRYGW
NHFAGHDEER ADEFSRFDIT PTGISALYDY RNEANGHGYI VEIDPYSPNQ RGIKRTALGR
FRHEGCTFGK LEEGKPVVFY SGHDSRFEYL YKYVSDALWS EADANSSNRI ATGDKYMNDG
TLYVARFDED GQGEWLPLVP GTVTTDGKTL GEHLGTLADI IVNTAGAADL VGATPMDRPE
WCAVDPFTGS VYLTLTNNTR RTEANPANPR LKNSFGHIIR WNEGASDTEF SWDIFVFGSP
ANGDSETNLS GLTDENQFAS PDGLAFDPRG IMWVQTDNGA KEVTEHTNDQ MLAVVPSRLL
DAEGNQKPLR ADNQMELRRF FVGPNDCEVT GVAFSPDYQS LFANIQHPGN WPYSDDATQI
TPAGISIRPR AATVVISKVD GTEVGV