Gene Sama_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2003 
Symbol 
ID4604253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2432403 
End bp2434664 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content58% 
IMG OID639781380 
Productinter-alpha-trypsin inhibitor domain-containing protein 
Protein accessionYP_927878 
Protein GI119775138 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0804312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.260211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGATG ATAAGAGTAT GTATAGCCAA AGGAAGGTCG TGCCCAAGGT GGAGATGAAG 
CAAGGCCTGA CTGCACTGGC AATGATGGCG TTGCTGGCAG CGCCAACAGT CAGCGCGAGC
TCAGCCAGCA CAGTGCAGCC CGTAACCTTG CCACTCGATG GTCCAATGGC TGGCCAGGGC
AGTACTTCGC CAGGCGACAT CATAGCACCG GGCCATGGCG GGGAGCTGAT GTTTGAAGAC
GAGGGCACGG AGCAAGTCAG TCTTGCCCTC GATACCCGCT ACACGGTTAA GGTCAGTGCC
TTGGTCAGCC GGGTTACAGT TTCTCAAACC TTTCAAAATC AGAGTAATAG AGTGATAAAC
GGCCACTACC GCTTTGCGCT GGCGCCCAAT GCGGCGGTGA GTGGCATGCG CCTTACAATC
GGCGAACGCC TGATTGAAGG AGAGATCCAG GAAAAAGCTC ACGCAGAGCG GGTATACGAG
CAAGCAAAAC GGGATGGTAA ACGGGCGAGC CTGGTCAGCC ACAGCGAAAG CAATCTTTTT
TCGACACGGA TTGCCAATTT CATGCCCGGC GAAACCCTTA CCGTCAGTAT CGATTATCAG
GAGATTCTGC ACCCACAAGC GGGCCGGGTT GAGCTCAGGA TCCCCACTGC ACAAACGCCA
AGGTATGGCG CCTCAGCAAT GCTGCCTTCG GCGCAGAACA GGATGCAGGA TGAGCCTTGG
GTATCCGGGG TGCAGGGCGC GGCGCAGAAC CATCTTCATG CCGCTGCGCC TTTTTGGCCC
GCCACCCGGG CAGAGGCTCC GGCAGACACG CCAACACCCT CGCTGTCACT CAGCGCCAGT
ATTTTTGGCT TTGCCGTGGC GGCGGTCGAA AGCCCATCCC ATGGCTTGTG TGAGCAAGGC
TATCGCGATG GTGGCTGGCA GTTATCGCTT TGCCCCGACA CCGAGGCTGA CAGGGATTTG
GTGCTGTCCT GGGTGATTAA CGAAGGCAGT GAGCCGGTTG CAGACTTTTT GGTGCAGCCG
GGCTACAGCT ATCCCGCCGC TGTGGAAAGC GCAAACACTC AGTATGGAAC TGCTCAGGAT
AAAGCCAAAC AGGATGAATA CGTCCGGGTA GATTTTTCCC AGGGGAATTA CAGCCACGGT
TTGCTGACCT TTATGCCACC TCAACCCAAT TTGGCCAATC GCCTGGCGCG GGAGCTGGTA
TTGGTGATCG ATACCTCAGG TTCCATGGCC GGAGACTCCA TGGTGCAGGC AAGAAGTGCC
CTTATCCACG CGCTCGGCGG TTTGGGGCCA CAGGACAGCT TCAATATTAT TGCTTTTTCC
AGTGATGCCA GGCCGTTGTG GCCGGACGCA AAACCGGCAA CGGCATTTAA TCTCGGGGCA
GCCCAGCAAT TTGTGCGCAG CCTTGAGGCC GATGGCGGCA CTGAGATGGC ATCTGCGTTG
GAGCTGGCCC TTAAAACGCC TTCAGTTGTA GATGAAGACA CCAAACGGTT GCGACAGGTG
CTTTTTATTA CCGATGGCGC CGTTAATGGC GAAGATGCGC TGTTTAATCT GATTGAGCGG
CGTCTGGGCA CATCGCGGCT TTTCCCCGTG GCTATCGGCG CCGCGCCAAA CGGGTATTTC
ATGAGCCGGG CGGCTGCAGC AGGTCGTGGC AGCTTTACCT TTATCGGTCA TGGCGGCGAA
GTGGCAGAAA AAATGAATCA GCTCTTGAGT CGCATTGAGC ACCCCGTGGT TAGCGATCTG
TCCGTGACCT GGGCTGACGG TAGCCCCGTT GATGCCATGC CTGGGGTATT GCCCGACTTG
TATGCGGGCG AGGCGCTTAA CCTGAGCCTC AGAACGGTCC CCGATGCGCT GATGCCCATC
ATAGTCAAAG GTAATACGGA CGGCCAAATC TGGGAGCGCA AGCTTACCCC ACGTGCCGTG
CCGGGCGGCA GCGGCCTTGA CCTGCAATAT GGCAAAGCCC GGGTTGATGA CCTTGTCCGA
CAAACATTAA CACCTTCCCA GCGGCGTGCA GCCACTGTGG CGTTGGGACT GGAATATCAC
CTTGTCACAG CCCACACCAG CCTGGTTGCC GTGGATAAAA CCCGGGTCTC ACACGGCCAG
GGACTGGATG CCAGGTTGCC GGAGGCAACG CCCCATGGCT GGCAAGGCGG CAGATTGCCC
CAAACCGCGT CGGGTAGCCT GGGATTGATG TTGGCCGGCG CTTTGCTGGT GCTGTTTGCT
ATCGCAGCGG CAACGTGGAG GCGGGAAGAT GAAGCGGCTT AG
 
Protein sequence
MKDDKSMYSQ RKVVPKVEMK QGLTALAMMA LLAAPTVSAS SASTVQPVTL PLDGPMAGQG 
STSPGDIIAP GHGGELMFED EGTEQVSLAL DTRYTVKVSA LVSRVTVSQT FQNQSNRVIN
GHYRFALAPN AAVSGMRLTI GERLIEGEIQ EKAHAERVYE QAKRDGKRAS LVSHSESNLF
STRIANFMPG ETLTVSIDYQ EILHPQAGRV ELRIPTAQTP RYGASAMLPS AQNRMQDEPW
VSGVQGAAQN HLHAAAPFWP ATRAEAPADT PTPSLSLSAS IFGFAVAAVE SPSHGLCEQG
YRDGGWQLSL CPDTEADRDL VLSWVINEGS EPVADFLVQP GYSYPAAVES ANTQYGTAQD
KAKQDEYVRV DFSQGNYSHG LLTFMPPQPN LANRLARELV LVIDTSGSMA GDSMVQARSA
LIHALGGLGP QDSFNIIAFS SDARPLWPDA KPATAFNLGA AQQFVRSLEA DGGTEMASAL
ELALKTPSVV DEDTKRLRQV LFITDGAVNG EDALFNLIER RLGTSRLFPV AIGAAPNGYF
MSRAAAAGRG SFTFIGHGGE VAEKMNQLLS RIEHPVVSDL SVTWADGSPV DAMPGVLPDL
YAGEALNLSL RTVPDALMPI IVKGNTDGQI WERKLTPRAV PGGSGLDLQY GKARVDDLVR
QTLTPSQRRA ATVALGLEYH LVTAHTSLVA VDKTRVSHGQ GLDARLPEAT PHGWQGGRLP
QTASGSLGLM LAGALLVLFA IAAATWRRED EAA