Gene Shewana3_1845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1845 
Symbol 
ID4477904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2184010 
End bp2187216 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content52% 
IMG OID639726427 
Producthypothetical protein 
Protein accessionYP_869482 
Protein GI117920290 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.725746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.580829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAACA CGCCGACATC GCCAAATCAG GAGACTGCCG AGCACTCGCA TCAGGCGATG 
CCTCAAGAGG CGACTGCGGC CGCAGCGCCA CATTCGGCGG CGTCTCGCCG CGGTTCTACA
CTCAAAAAGC GCCTAAAACA AAGCCTGATT GCCACCACAG TGGCCGGCCT GTTAGCTGGC
GCGGCGCTAA CCATTTGGAT AGTCAACAGC CAAACCGAAG CCCTGATCCT TCGTGTCGCA
AATTACGCAT TAAGCGGTAT GGACGGTGAG CTTAGCGATA TTCGTTTAGG CCCCATGGGT
TTAGAACATT GGCACATTCG CTCTGCGAGC CTGCGTGTAC ATGACTCGCA TTTAGTGATT
AATAATCTTG ATATTCAGCT TGAACTCAAC TGGCCCAAGA GCCTTGAAGA ACTGAAGCAA
CTTGTCCAAG TTGAGAGTTT GACTCAAAAA ATCAAGCGTA TTAGCACGGG CGAGATAGAC
GTTGAACTCG GCGCATCGCT GCTGGAGCGA AGCCCCACCA TCGCCGATGA ACAAACGCCA
GCGTTGGCGT TAAATATCAA ATCCTTACCG TTAATTGATA TAGGTAAAAC CACACTCAGA
TTAGCGCCGC AAGCTGAGCT TCCCGCCTAT CAGCTGGTGA TGGATAAACT CAGCCTGAAT
CATCAAGGTG AATTAACTAC AGCCTTTAGC AGCCCTGAGG GTGAGCCGTT AGTCCAGCTT
GCCGCCACGC TAGGCAACGA ACAATGGCGC CTCAAGAGCG AACTGAATAT CGCGCCGCTA
CTGGAAAACC TGCATCAAAT TGGCCTGCGC CAAACCCAAG GGAGCATCCT CAGCCAACTA
ACTCTGTGGG ATCAACAGTG GCAGCAACTT GGGATAGGCT TAAGTGGGCA ACTCAGCTCT
GAGAGCACGA TGACACTCGC CAGCGGCGAA ATCACAAGCC GCCACCGCAT TCAGCAGCCT
AACATCAGCT TAAGCCATTT TGCCGACTTA ACGCTCGCGC CCCAGCCGGC TTTGGGGTTT
GAGCTTAGTG GATCACTTGC CTCGCTCAAT CTCACCCTTG AGCCCTTTAG CCTTGCGCTT
ACGCCGAATG CGGCACAGCA CACGCAGCTA TTAGCGGCGC TTAATCAGTC TCTGCAATTG
AACGATGAAA ACTCTCAAGC GCTCCTTACC CTGCTCTCGG GGCTGAAAAG CACCGAGGCG
CCCGTGGGTC TTGGCTTTTC GATGACAGCG CCGCTGCACT ATGCGCTGAC ATCTCAAGCG
AGCCCCCATA AACCAATAGC GCTGCCCGCA ATAGAGTTAA CCACCCTAGG CTGTAAGCTT
GAGACGCGCA TTCACTTGCA AGAGACCCAG TTAACGCCCA CACCAGATGC TTGGAATGTC
GCGAGCCGCT GGCAACTGGC GCTTAAGCAA ACTTCCCCAC TGACGCTGCG AGACCTCTGG
CACGTAGCCC CGCAGGATCT CAGTTGGGGC GCGGGAATGC TGCAAACGGC GGGTCAGGTC
AGTGTTGCTC AATCGGCTCA AGGACTGAAC TGGCAAGTCA GCACTGCCCC AGTGACGAGT
GACTCAAGTG ACTCAAGTGA CACTCTGCAA TTTGCGCTGG AAGATGTGCA GCTACAACAA
ACAGCGCAAG CCGCCGAGCA CCAAACGAAA CAGACGCAGT TAAGCCTTGG CAGTATTCAA
CTCAACGCCA AGTCCCCCTT GGCCGCAAGT GCAACCCCGT TAGCGACTCA GGATACTACT
GGCGCACAAT CGACCGAGTT TGCCTTCAAG CTGCCGCCTT TATCGCTCGC CTTGTCGCAG
CTGCGCGTGA GCCAAGCGGT AGAGAATCTT GCGGGCAACA ACAGCACCGC GACTGCAATA
CAGAGCAGTC GCAATGATAT TAGCCTCAAG GCGTTTTCCC TTGAGACATC AAAGCCGATG
ACCCTCGATT ACTCGTCGCT GCAATCCATT GTAAATACTA TTCAATCGAG TCAGTTGACT
AACCAAGTTA ACTGGCAAGC GCAGCAACTC TTGATTGAAA AGCAGCTCAG CGCCAAAGGC
CGAACACGTA AGCAGACTGT GCTTAAACTG GATAATTTAA CGCTCGCGCA GACATTAAAC
TGGCAAAACC AACGTCTTCA CGGCCATGAG CAGTGGCAAG TCGGCAAGGT TGCGCTACAA
AGCGATCATC AATTACAGTT CGCCGCGCCT CACAAGCCCC TGCTGTTAAC GGGCCAGTGG
GTTGTCGATA CCAGCATGAC AGAGGCGCTA TCTTTGCTTA ATCAAACCCA GCCCTTGCCC
GCTGAGTTAA ATGTGACCGG TCATAACCAA TTACAGGCAC AATTTAAGCT GACACATGAG
CCAGAGCAGA CCCAATTTGC GATGCAAATT ACCCAGTCGA TGACAGAGCT GGAAGGTTTT
TATCAAGACA CGACCTTTGA GGGCGGGAAA TTACAGGCCC AATGCGAGTT CACCTGGGGG
CAATCCTACA AGTCGCCGCA AGCGCCAGGC TATTTCAGCA GTTTAAGCCG ACTAAACTGC
CCGCAAACCA TGATGACTTT TAACTTGTTC GATCCCGGCT TCCCCTTGAC CGATATCGAA
GTGGAGGCCG ATATTGTCCT CGCCAAGGAT GCTGAGAAGC TTCCCGACAA TTGGCTGCAA
CAACTCACGG GCTTAAGCGA TACCGATGTC TCGATGACCG CCAAGGGTAA AGTATTGAGC
GGCCAGTTTT TACTGCCTGA TTTTAATTTA AAATTGCAGG ACAAATCCCA CGCCTATCTA
TTATTGCAGG GCATGAGCCT TGAAGAAGTG CTGCGGATTC AGCCGCAAAT TGGTATTTAT
GCCGATGGTA TTTTCGATGG TGTGTTACCC GTGGATTTGG TCGATGGCAA AGTCTCGATC
AGCGGCGGCC AACTGGCGGC CCGCGCGCCC GGCGGTCTCA TCGCCATTTC GGGTAATCCT
GCGGTCGATC AAATGCGCCA ATCGCAGCCT TATCTCGATT TTGTATTTTC GGCGCTAGAA
CATTTAGAAT ACAGCCAGTT ATCCAGTAGT TTCGATATGG ATCAAACCGG CGATGCCAAC
TTATTAGTGG AAGTCAAAGG CCGCAGCCGA GGGATTGAAC GCCCCATCCA CTTGAATTAC
TCTCATGAAG AGAACATGCT GCAATTATTC AGGAGTCTTC AGATTGGTAA CGATCTGCAG
GACAGAATCG AAAAATCCGT GAAGTAA
 
Protein sequence
MVNTPTSPNQ ETAEHSHQAM PQEATAAAAP HSAASRRGST LKKRLKQSLI ATTVAGLLAG 
AALTIWIVNS QTEALILRVA NYALSGMDGE LSDIRLGPMG LEHWHIRSAS LRVHDSHLVI
NNLDIQLELN WPKSLEELKQ LVQVESLTQK IKRISTGEID VELGASLLER SPTIADEQTP
ALALNIKSLP LIDIGKTTLR LAPQAELPAY QLVMDKLSLN HQGELTTAFS SPEGEPLVQL
AATLGNEQWR LKSELNIAPL LENLHQIGLR QTQGSILSQL TLWDQQWQQL GIGLSGQLSS
ESTMTLASGE ITSRHRIQQP NISLSHFADL TLAPQPALGF ELSGSLASLN LTLEPFSLAL
TPNAAQHTQL LAALNQSLQL NDENSQALLT LLSGLKSTEA PVGLGFSMTA PLHYALTSQA
SPHKPIALPA IELTTLGCKL ETRIHLQETQ LTPTPDAWNV ASRWQLALKQ TSPLTLRDLW
HVAPQDLSWG AGMLQTAGQV SVAQSAQGLN WQVSTAPVTS DSSDSSDTLQ FALEDVQLQQ
TAQAAEHQTK QTQLSLGSIQ LNAKSPLAAS ATPLATQDTT GAQSTEFAFK LPPLSLALSQ
LRVSQAVENL AGNNSTATAI QSSRNDISLK AFSLETSKPM TLDYSSLQSI VNTIQSSQLT
NQVNWQAQQL LIEKQLSAKG RTRKQTVLKL DNLTLAQTLN WQNQRLHGHE QWQVGKVALQ
SDHQLQFAAP HKPLLLTGQW VVDTSMTEAL SLLNQTQPLP AELNVTGHNQ LQAQFKLTHE
PEQTQFAMQI TQSMTELEGF YQDTTFEGGK LQAQCEFTWG QSYKSPQAPG YFSSLSRLNC
PQTMMTFNLF DPGFPLTDIE VEADIVLAKD AEKLPDNWLQ QLTGLSDTDV SMTAKGKVLS
GQFLLPDFNL KLQDKSHAYL LLQGMSLEEV LRIQPQIGIY ADGIFDGVLP VDLVDGKVSI
SGGQLAARAP GGLIAISGNP AVDQMRQSQP YLDFVFSALE HLEYSQLSSS FDMDQTGDAN
LLVEVKGRSR GIERPIHLNY SHEENMLQLF RSLQIGNDLQ DRIEKSVK