Gene Shewana3_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2085 
Symbol 
ID4476331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2497516 
End bp2500353 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content43% 
IMG OID639726670 
ProductTonB-dependent receptor 
Protein accessionYP_869721 
Protein GI117920529 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000059831 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000324721 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTTAAAC AGACTTATTT GGCAAGTGCT ATTCTGCTTG CTCTTGCTAG TCAAGCGACG 
TATGCGGCAG AAGCTGAAGC GACACAATCG CCTCAGACGG AAGAAACGTC ACGCCCCTCT
TCTGGCGGTA AAGCGCAAAA TAACGAAGAG ATGGAGATCA TTCAGGTTCA GGGGATCCGT
GGTAGTTTAA ATAAAGCGGT AGAACTTAAA CGCCAAAACA TTCAAGTTGT CGATGCCATT
ATTGCCGAAG ATATCGGTAA GTTCCCAGAT AATAACGTGG TTGAAGCCTT GCAACGGGTA
ACGGGTGTTC AAGTGACTGA CCGTGCATCA GGTGAAGCCA ATACCGTGAG TATTCGTGGT
TTAACCGACG TAACCACTAC AGTTAACGGC CGTCAAGTTT TCACTGCAGC AGGTCGTGAA
GTCGCTATTG CTGACGTACC TGCGGCATTA CTCGGCAGTG TTGAAGTCTT TAAAACCCGC
TCTTCTTCTC AAGTCGCCAG CGGTATCGCG GGTCAAATTG ACATTCGTAC CCATAGACCT
TTTGATTTCC AAGAAGAAAA AATTTCGGTA GCAGCCAAAG GGATTTACTC TGATCAGCCT
GACACTATCG ATCCCAATTT CAGCGCCTTA ATTAGCGACC GCTGGGATAC CAGCATCGGT
GAAGTCGGCG CATTAGTTAA CGTCTCCTAT ATCCGCACCA ACTATAATGA TCAAGTGGTT
GCACCTGGCG CATCAATGCC CTATTACGCT GAAACTGGTG TCCAAATCCC TGATACTTAT
TGGAAGCGTG GCCTAGCCCA TGGTTTAGAT ACCAGTGAAG GGGCATTAAT CAACGGTCAG
GAATATCTGC TCATGCGCGA CGCCGTATTC CAAAATATGA ATAACGGTGA GCGTGAACGT
CCAGCACTGA ACCTATCATT GCAATGGGCG CCAACTGACA CTTCCGAATA TCTTTTTGAA
GCATTCTATA ACGGTTATCG TAATGATAAC TTCAACTCTA TGCTCTTTAG TAACGTAGAT
TCATCCGCTA ACTGGAGCCA AGTGATTCAA GATGGTATTG AAGTTTATGA TGGTACCAAC
GTGGTAAAAT CGCGAACTGC CTATAACGTT GACGGATTTA GCAGTTCAGA CCATAGCCAT
AACAAAACAG ATAGTTATGT ATTTGCCTTA GGAGGTAAGT GGGACTTTGA TAATTTGACC
TTAAAGTCTG AAGTGGTCTA TCAAACCAGT ACCTATGAAT CACAATTTCT GGCGATGCGC
GGTGTAAGCG CGAATAGCAC TAAGACTTTC TACGGTGTTG ATGTTGACTA TAACAACAAA
AGTGGTGGTG TACCTAGTTG GGTTTATTTA GATAATCCAG ATACAGATAT TAATGAGTCT
GATTTAACTC AATATGCCCT ATGGCAAACG GCTCAATTAT ACGATAGTGG CGCTAAGGAC
GAAGGTGATT CAGTTACTTG GACATTCGAT GGTGACTATT TCCTCGACTA CGGCATATTT
ACTAAGATGA AATTTGGTAT TCGTGCAGAA CAACGTGGTG CGAAACACGG ACTTTACGAT
GCCGGTTCCT TAAACACCAA CGTGTTATTT AGCGATCTTG ACCCTAACAT GTTCACCGTA
ACCTCCGGCT TCTTTGATGG CCGTGGTAAT GTGCCGACAA GTTGGGCGAT TGCCGATGGT
AATTATTTAT ATGCTAATCG CTCAGCTATT GAAGCTATGT ATAACGCCGA TGAGCAAGCC
CTTGCCATGT ATACCAACTA TGACATCACC GAAAAGTCAT ATGCTGCATA TATTCAAAGT
GATTTTGAAA CCGAGTTATG GGGTAAGCAA GTTGATGGTC AAATCGGCCT TCGCTATGAG
AAAGCCGATG CAGATATGGA TTTCTGGGAA CGCAAAAAAT ATGCTGCAGA GGATGTTGTT
GATGACATTC GATACTATCT CGTTCATGAC ACAGATACCA ATGGAAGTGC GGAGTTATTA
CCAAGCTTAA TGGTACGCTT CTGGTTAACT GACGACTTGG TTGCACGTTT TGCCTACACT
GAAACGATTC GTCGTCCCGC CTTTGGCGAT TTAACCACTG CGATTTCTTA TACAGAGGAC
TTAACAAAGC TTGGGTTCGG TCAAGCTTCA AGTGGTAACT CTAAGTTAGA ACCTACTACT
TCACAAAACT ATGATTTCTC ATTGGAATAT TACTTTGGTG AAGGATCTTC CATATTCGGT
ACTTATTTCC GTCGTGATAT TGAAGGTTTC GTATTTAACT CTTCTTCTAC TATTACCCGT
GAAGTTGACG GCGAAACCAA AAAATATATT TTAAGCCGTC CAGAAAACGC TTCAAACGGA
AAGCTGACAG GATTTGAGCT AGGTACTGTA TATTTCCCAG AGAATATGCC TAGTTACTTA
GATGGCTTAG GTACTCAAGT CAGTGCAACC TTCTTAGATT CAAGCCAAGA CATTCCTGAG
TTTGACTCTG AGAGTGGTGA GCAAATTGGT ACTACAACTC GTGAACTATT CGGTGTGTCT
GATACTTCTA TGAGTGCGGT ATTGATTTAC GATAAAGACG ATTACAGCGC GCGTTTATCA
TACACTTGGC GTGATAAATT CTTAAGTGCG TATGATGCAG GTAGCTTTGC AATGCCACGC
GGTATTTACC GTAAACCTGA GCAGTCACTT GACTTCCAGT TCAGCTACAA CATCAGTGAT
GAGTTTGTAT TAACCTTCGA TGCGACAAAT ATTCTTGATG ATATCTATCA AGAATACTAC
GAGGACTCAG TACTCTATAA CCGTACGAAC AGTATCTATA CTCGTACTTT TGCATTGGGT
GCGCGTTACT CCTTCTAA
 
Protein sequence
MFKQTYLASA ILLALASQAT YAAEAEATQS PQTEETSRPS SGGKAQNNEE MEIIQVQGIR 
GSLNKAVELK RQNIQVVDAI IAEDIGKFPD NNVVEALQRV TGVQVTDRAS GEANTVSIRG
LTDVTTTVNG RQVFTAAGRE VAIADVPAAL LGSVEVFKTR SSSQVASGIA GQIDIRTHRP
FDFQEEKISV AAKGIYSDQP DTIDPNFSAL ISDRWDTSIG EVGALVNVSY IRTNYNDQVV
APGASMPYYA ETGVQIPDTY WKRGLAHGLD TSEGALINGQ EYLLMRDAVF QNMNNGERER
PALNLSLQWA PTDTSEYLFE AFYNGYRNDN FNSMLFSNVD SSANWSQVIQ DGIEVYDGTN
VVKSRTAYNV DGFSSSDHSH NKTDSYVFAL GGKWDFDNLT LKSEVVYQTS TYESQFLAMR
GVSANSTKTF YGVDVDYNNK SGGVPSWVYL DNPDTDINES DLTQYALWQT AQLYDSGAKD
EGDSVTWTFD GDYFLDYGIF TKMKFGIRAE QRGAKHGLYD AGSLNTNVLF SDLDPNMFTV
TSGFFDGRGN VPTSWAIADG NYLYANRSAI EAMYNADEQA LAMYTNYDIT EKSYAAYIQS
DFETELWGKQ VDGQIGLRYE KADADMDFWE RKKYAAEDVV DDIRYYLVHD TDTNGSAELL
PSLMVRFWLT DDLVARFAYT ETIRRPAFGD LTTAISYTED LTKLGFGQAS SGNSKLEPTT
SQNYDFSLEY YFGEGSSIFG TYFRRDIEGF VFNSSSTITR EVDGETKKYI LSRPENASNG
KLTGFELGTV YFPENMPSYL DGLGTQVSAT FLDSSQDIPE FDSESGEQIG TTTRELFGVS
DTSMSAVLIY DKDDYSARLS YTWRDKFLSA YDAGSFAMPR GIYRKPEQSL DFQFSYNISD
EFVLTFDATN ILDDIYQEYY EDSVLYNRTN SIYTRTFALG ARYSF