Gene Shewana3_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2207 
Symbol 
ID4478603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2649216 
End bp2652050 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content52% 
IMG OID639726802 
ProductTonB-dependent receptor 
Protein accessionYP_869842 
Protein GI117920650 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCTC GTACCCCACC ATTACGACAA GAGTTACAAC AAGGCTCCCA AGCGCCGCAA 
AAGCAGCACG CCATTCCTAA AACCCGATTG GCTGCACGTC TGTCGGCAAT AAGCTTAGCC
ATGATGGTCG CAGGCTTTAG CACTAGCGCC CTTGCCGTTG GTAAGCTTGA AGGGCAAATT
CGCGATAATG TCAGTCAACA ACCCCTTGCT GGCGCGACCG TCACCTTAAA AGAGCTCAAC
CTGAGTCAAC AGGCAGGCCG CGATGGCCGC TTCTTCTTTG TCGGCGTGCA AGATGGCGAT
TACACCTTAG TAGTCAACTA CCTCGGTGCT ATGCCGCTGG AACATAGCGT TAGCATTCGC
GATAAACAAA CCACGCTGCA AGATATCAAC TTAAGCAGCC AAGATGTCGA GCATATTCGT
GTCGTCGGCC AACAAGGCGC ATTAAGTAAA TCCATGAACC GCCAACGCGG CGCCGACAAT
GTACTGAGTG TGGTCAGCGC CGATGTGCTG GGGAATTTCC CCGACAGCAA TATCAGTGAG
TCGCTGCAGC GGGTGCCAGG GCTATCTATC GAGCGAGATC AGGGCGAAGG TCGTTTTGTG
CGTGTACGCG GTATGGCGCC GGACTATAAC TCAGTGTCGA TGAACGGCAC GCGCTTGCCG
TCACCCGAGA GCGATCGCCG TGCGGTTGCC CTCGATGTAG TGCCATCGGA TCTCTTGCAG
TCGGTAGAAG TGAGTAAAAC TCTAACGCCG GATATGGATG CCGATGCACT GGGTGGCGCC
ATTGAGGTTA AAAGTCTGTC GGCCTTCGAT CGCGATGATA CTTATCTCAA CCTCAACGCC
GAGGCGAGCC AAGATACCTT AACCGACAAT ACCAATCCCA AACTCGCCGC CAGCTACAGC
GATATCTTTG CCGACAAGCT AGGGGTAGCC ATAGGTGCCA GTTGGTATAA CCGCGACTTT
GGCTCAGATA ACGTCGAAAC CGGCGGTAAA TGGGAATTTG CCGGGGATAA CGGCTTGGAA
GATGCGGCGC TTGAATCCAT CGACGCGCGG GATTATGAAA TCAATCGGGA ACGTTTAGGC
ATAGGCGTGA ACTTCGATTA TCGCCCAAGT GATGATACCG ATCTGTATCT GCGCACCCTT
TACAGTGAGT TTGATGATAC CGAAACCCGT AACAGCGCCA AAACTAAGTG GAAATCGCCG
CAGCAAGCCA ATGCCCTCAG CCAAGGCAAA ACCACTCGCT CGCTAAAATC ACGCACCGAG
AACCAGAACA TCACCTCCTT TGTACTGGGC GGTCAAACCC GCTTCGAACG CTGGACCTTC
GACTATCAAG CGAGCCACAG CACCGCCAGC GCCGAAAAAC CGCGGGATAT CGCTGGCGCC
GACTTTGTCG CCAAGATTGA TAACACGGGC TTTAGCAATA CCAAACAGCC ACAGATTATC
GCCCCCGAAG ACTACTTTCA AAACGCTAAC TTTGAGCTAG ATGAGATTGA AATTGCCGCC
TCCAAGGCCG AAGACACCAT CAACAGTGGC CAACTGGATC TCACCCGCCA GTTAACCCTC
GCGGATTACA GTGTCGAGCT AAAAACCGGG GTCAAGCTGA GTCGCCGCGA TAAATCCAAT
CGCGAAGATA TCTGGATCTA CAGCGACTTA GGCGATCAAG GCGTGAGCGA TGACGATTTG
CTATTGAGTC AGTATGCGGG CAATGAGCTT GACTATGACC TCGGTCGCTT TGGCAGCGGG
ATCAATGCTG CGCCGTTATG GCAGCTTATC GATAGCCTCG ATGCCGACAG CAATCGCGAT
GATATCGAGT CCACCATTAA CGATTTTGAT ATCAGCGAAG ATATCAACGC CGCCTACCTG
ATGGGCCACA TCGATATCGA TAAACTGCGT ATCTTAACTG GGCTGCGTTT CGAGCAAAAT
CAATGGGATT CCAGTGGCTA TGGTTATGAT GGTGCCAAGG GCGAATTTAT CGACATCAAG
CACTCCCGCG ATGAGGATCA TTGGCTGCCC GCACTGCACC TCACTTACCG CTATAGCGAC
AATACCGTGC TGCGCGCTGC CTGGACCAAC ACCTTAGTCC GCCCGACGTT CGGCCAATTA
GCGCCGGGAT ATTTGCTCGA AGAGGATGAT GGCGATATCG ATTTAACCTT TGGCAATCCA
CAGCTGAAGT CGCTCGAATC GATGAATTTT GACTTAAGCC TAGAGCACTA CTTCGGCAAT
ATCGGCTTAA TTTCGGCGGG GTTGTTTTAC AAAGATATCG ACAACTTTAT TTATCAGGCG
GATTTAGCTG GCCGTGGCGA TTATGTGGAT GCCCATAGCG CCGTGACCTT CGTCAATGGC
GACAGCGCCG ACATCTACGG CGTGGAACTC AGCTATGTGC AAGAGTTTAA CTTTTTGCCC
GAGCCCTTTA ATGCACTGGT GCTCAACTCT AACCTTACCT ACACGGATTC CAGCGCCAAG
ATCAGTTGGC TGGAGGATGG CCAGTTACTG AGCCGCGATA TTCCGATGCC AAGTCAATCG
GATCTCACCG CTAACCTATC ACTTGGCTAT GAAAACAGTT ACGCCAGTGT CTGGTTATCG
GCGGCCTATA AATCCGAATA TTTACAGGAA GTCACTGAGC TCAGTGATGA GCGCTACGAT
CTCTATCAAG ACAATCACTT GCAGTGGGAT TTTGTCGCCA AGGCCCATTT AACCAGCAAT
TTAACCTTGT ATTTCAAAGG GGTGAATCTG ACCGACGAGC CCTACTACAG CTACACGGGT
GACAGTGCTT ACAACGCCCA ATACGAAGCC TATGGCCGCA CTTTCCAGTT AGGCGTGCAG
TACACCAACT ATTAA
 
Protein sequence
MQSRTPPLRQ ELQQGSQAPQ KQHAIPKTRL AARLSAISLA MMVAGFSTSA LAVGKLEGQI 
RDNVSQQPLA GATVTLKELN LSQQAGRDGR FFFVGVQDGD YTLVVNYLGA MPLEHSVSIR
DKQTTLQDIN LSSQDVEHIR VVGQQGALSK SMNRQRGADN VLSVVSADVL GNFPDSNISE
SLQRVPGLSI ERDQGEGRFV RVRGMAPDYN SVSMNGTRLP SPESDRRAVA LDVVPSDLLQ
SVEVSKTLTP DMDADALGGA IEVKSLSAFD RDDTYLNLNA EASQDTLTDN TNPKLAASYS
DIFADKLGVA IGASWYNRDF GSDNVETGGK WEFAGDNGLE DAALESIDAR DYEINRERLG
IGVNFDYRPS DDTDLYLRTL YSEFDDTETR NSAKTKWKSP QQANALSQGK TTRSLKSRTE
NQNITSFVLG GQTRFERWTF DYQASHSTAS AEKPRDIAGA DFVAKIDNTG FSNTKQPQII
APEDYFQNAN FELDEIEIAA SKAEDTINSG QLDLTRQLTL ADYSVELKTG VKLSRRDKSN
REDIWIYSDL GDQGVSDDDL LLSQYAGNEL DYDLGRFGSG INAAPLWQLI DSLDADSNRD
DIESTINDFD ISEDINAAYL MGHIDIDKLR ILTGLRFEQN QWDSSGYGYD GAKGEFIDIK
HSRDEDHWLP ALHLTYRYSD NTVLRAAWTN TLVRPTFGQL APGYLLEEDD GDIDLTFGNP
QLKSLESMNF DLSLEHYFGN IGLISAGLFY KDIDNFIYQA DLAGRGDYVD AHSAVTFVNG
DSADIYGVEL SYVQEFNFLP EPFNALVLNS NLTYTDSSAK ISWLEDGQLL SRDIPMPSQS
DLTANLSLGY ENSYASVWLS AAYKSEYLQE VTELSDERYD LYQDNHLQWD FVAKAHLTSN
LTLYFKGVNL TDEPYYSYTG DSAYNAQYEA YGRTFQLGVQ YTNY