Gene Sama_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1661 
Symbol 
ID4603912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2030305 
End bp2033127 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content53% 
IMG OID639781024 
ProductTonB dependent receptor-related protein 
Protein accessionYP_927537 
Protein GI119774797 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.55964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAT TTAAACCAAA CCTGCTGACC GTGGCGTTGC TCGCTGCCGG TTTCAGTATG 
CAGGTACTGG CAGCCGAACC TTCTGAAGAA AAAGATGTAA ACAAAGAAGA CGCTGCCAAT
ATCGAAGTGA TCACCGTAAA GGGCTTTCGC ACCAGCGTGA TCAAGTCGCT CAACGAAAAA
CGTTTTGGCG ATACAGTCTC TGAATCCATC TCTGCCGACG ATTTGGGCGC CCTGCCCGAT
CAGTCCATCG CCGACGCCCT CACCCGCTTG CCCGGCATCA CTGCCGTGCG TACCGGCGGC
CAGGCCAGCG GCCTGAACAT TCGTGGTTTG GATGGTGACT TTGTGTTTGC CACCCTGAAT
GGCCGCGAGC AGGTCACCAC CGGCAGCAAG CGTGCTATCG AATTTGACCA GTATCCTTCT
GAGCTCATCA GCCAGGCCGC TGTTTACAAG TCACCCAAGG CATCCCTGAT TGAAGGTGGC
GTGGCCGGTA CCGTTGAGCT GAAAACAGCC GATCCACTGC GCGCTGCAAA AGAGCAAAAC
TTTACCTTTA ATGCCCGCGG TGCCTTTAAC GATCGCGCCG ATGAAGTTGC CGATGCCAAC
GAGTACGGCA ATCGCTTAAG CTTCTCTTAT CAGGGCAAGT TCCTGGAAGA AACCCTGGGT
GTTGCCCTGG GTTACGCGCA CCTGTACCAG CCTTTTGTTG CCAGCCAGTT TATCGGTTTG
CGCTTCAACG ATGCCAAGAA AGATGTCAAC GGTGATGGCG TGGTTGAGTC CATCAGCGAA
GGCTTTGAAA TGCAGCAAAA AGGTGGCGAA GACACCCGCG ACGGTTACAT GGCTGCCATT
CACTGGCGCC CCAATGATAA CTGGTCTTTC AAAGGTGATT TGTTCCACTC ACAGTTCGAT
TCAGAAAACT TTGCCCGCGG TTTCCGGGTG AAGTCACTGC AAAACGGCAC CATCACCAAC
GCCAACATTA AAAATGGCTC CATGATTGGC GGCACCGTGA GCACCGATGG CACAGATAAC
TTTGCCCTGT TTGTGGTGAA CGATAACGAC TCCAAGTATT CAGAGCTGAC CTCTGGCTCT
TTCAACGTAG AGTGGAACAA CGGCGATGCC CTGACCGTTT CAGCCGATAT CAGTTACTCC
AAGGCCGATG GCGAGTTTGT GAACGGTGGT ACCCGTGCCG TTGTGTATCA GGATATCGAC
AATGAAATCC GCGCCGCCGA GTCCATCAGC TATCAGCTGA ACGATCTGAA CCCAGCCTCC
ATCTCGGTCA CCAACGACTA CACCGACCTG AGTACCCTGG GCCTGAAAGA AGTGGGCATG
TGGCCCTACA ACCAGCAAAA CGATCTGATT GCGTACAAGC TGGACTTCAA CTATCAGCTG
GATACCGCTT TTGTCTCTTC AGTTGAGTTT GGTGTGCGTT ACTCTGAGCG CGAATTCAAC
GCCCAACGTT CACAGGCAGG TTACGGTTTC GAGTTTGGCC ACAATCCGGC CAACCAACCC
GTACTGCGTC TGACCGACGA CATGACCAGC GTGGTTAACT TTGGCGGTGA ACTCGCCGGT
TACCCAAGCT TCCTTGCCAT CGACTTCGAC AAGGCCGTTG ATCTGGTGAA TGCACAGCTT
GCTGCCACAG GTCAGGATCC TTTTGCCCCT ACCGCCAACT GGTCCAATAA CTGGACCATG
ATCCAAAGCG GTGCGGTAAA CGAAGATGTA CTCGCCGGTT ATGTGCAGGC CAATCTGGAG
TTTGATCTGG CCGACGTGAA AGTGACTGGT AATCTGGGCC TTCGCGTAGT GCATACCGAT
CAGTCCAGTA CCGGTCTGCA GCAGGTGGGC TTTGGTCTGG GTGAAGCCAT CACCGACGAA
AAAGGCGTGG TGAGCACAGA TTATATCCGC AACGAAGTGG GTAAGACTTA CACCGACTAC
CTGCCCTCAT TGAACCTGAA CTTCCACCTG ACCGATAACG ATCAGCTGCG TTTTGCCGCC
GCCTCCGTGA TGGCACGCCC GCCTATCGAT AAGCTGAAGT CAGGCATGGG CAGCTGGTAT
GACGATGCCG CCACTCCCGG CTACAAGAAG TACAACGCAT GGGGCAACAC CAGCCCACTG
CTGGATCCCT TCTACGCCGA TCAGTTTGAC CTGTCTTACG AGCACTACTT TGAAGACAGC
GAAGGCGCCG TGGTCGTAGC CCTGTTCTAC AAAGACATCA AGTCTTTCAT TAACAACTTC
ACCATACAGC CATTCGATTT CGAAGCAGCC GGTTTCCTGG TGCCCGATAC CATTATCGAC
AATGGTGTTG AGTTCCCGGT GGTGAAAGAC GAAGGCCAAT ACCAAACCGC CATCAACAAC
GACAAGGGTG GCTACATCCG CGGCGTTGAG CTGGCTTACA CCCAGGTATT TGACTTCCTG
CCCGATCCTT TGAGTGGTCT GGGCTTCACC GGCAGCTACT CTTACTCTGA CAGTGAAGTG
CAGTTCACCA CCGATTTGAG CGGCTCATCT CTGGATATTC CGCTGCCAGG TTTGTCTGAG
CACGTGGTTA ACACCACACT CTTCTACACC CTGGACGGTT TCGATACCCG TTTGAGCATG
CGTTATCGCA GTGGTTATGT GTCTGAGCAG GTTGCGGTTG AAACCCAGCT GGCCTTCTTC
GATGCCGAAA CCATCTTCGA CTATCAGGCG TCTTACGCTC TGGATAACGG CCTTAAGTTC
CTGTTCCAGA TCAATAACCT GACCGACGAG CCCAACAAGA CTTACTTCGG TGAGGAGTAT
CAGACAGGGA CCATCCAGTC CTTTGGTCGC CAGTACTTCC TGGGTATGAG CTACTCAATG
TAA
 
Protein sequence
MMQFKPNLLT VALLAAGFSM QVLAAEPSEE KDVNKEDAAN IEVITVKGFR TSVIKSLNEK 
RFGDTVSESI SADDLGALPD QSIADALTRL PGITAVRTGG QASGLNIRGL DGDFVFATLN
GREQVTTGSK RAIEFDQYPS ELISQAAVYK SPKASLIEGG VAGTVELKTA DPLRAAKEQN
FTFNARGAFN DRADEVADAN EYGNRLSFSY QGKFLEETLG VALGYAHLYQ PFVASQFIGL
RFNDAKKDVN GDGVVESISE GFEMQQKGGE DTRDGYMAAI HWRPNDNWSF KGDLFHSQFD
SENFARGFRV KSLQNGTITN ANIKNGSMIG GTVSTDGTDN FALFVVNDND SKYSELTSGS
FNVEWNNGDA LTVSADISYS KADGEFVNGG TRAVVYQDID NEIRAAESIS YQLNDLNPAS
ISVTNDYTDL STLGLKEVGM WPYNQQNDLI AYKLDFNYQL DTAFVSSVEF GVRYSEREFN
AQRSQAGYGF EFGHNPANQP VLRLTDDMTS VVNFGGELAG YPSFLAIDFD KAVDLVNAQL
AATGQDPFAP TANWSNNWTM IQSGAVNEDV LAGYVQANLE FDLADVKVTG NLGLRVVHTD
QSSTGLQQVG FGLGEAITDE KGVVSTDYIR NEVGKTYTDY LPSLNLNFHL TDNDQLRFAA
ASVMARPPID KLKSGMGSWY DDAATPGYKK YNAWGNTSPL LDPFYADQFD LSYEHYFEDS
EGAVVVALFY KDIKSFINNF TIQPFDFEAA GFLVPDTIID NGVEFPVVKD EGQYQTAINN
DKGGYIRGVE LAYTQVFDFL PDPLSGLGFT GSYSYSDSEV QFTTDLSGSS LDIPLPGLSE
HVVNTTLFYT LDGFDTRLSM RYRSGYVSEQ VAVETQLAFF DAETIFDYQA SYALDNGLKF
LFQINNLTDE PNKTYFGEEY QTGTIQSFGR QYFLGMSYSM