Gene Shewana3_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1920 
Symbol 
ID4478228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2282229 
End bp2283893 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content50% 
IMG OID639726502 
Producthypothetical protein 
Protein accessionYP_869557 
Protein GI117920365 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.976156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGTA ACATGATAAA TAATGAAAAT ATCCGAGCGA AAGAGTCTAA GGTTAGACAG 
CTTTCGCCCG TGTGGTTGAT CCCTATGATC GCCGTCCTGA TTGGCTGCTG GATGTTGTAT
AGCTATTTCA GCCAATTAGG GACCGAAATC CAGCTGCATT TAAAAACCGC CGAGGGCATA
GAAGTCGGTA AAACCTTTTT AAAATCCCGC AATGTGAATG TAGGGGTGAT AGAAAGCATT
AAATTAAGCG ATGACTACAG CGCCATAGTG GCGACGGCGC GGATCTCAAA CGATGCCAAA
CGTATGCTAA AGCAAGATGC ACGCTTTTGG GTGGTGAAAC CGCGGATTGG TATGGAAGGG
GTCTCGGGAT TAGATACCTT GCTCTCAGGC GCTTATATCG AACTAGAGCC GGGGAAATCA
TCGACGCCCC AGTATGAATT TACTGTGCTG GACAATCCGC CAGTGGCATC GGCCGACGAG
GAGGGGATGC GGATCACCCT GACTAGCCCG CAGGCCGGTA AGTTAAGTGT GGGCGATCCT
GTGCTGTATG AAGGTTTTAG AGTCGGTCGA GTGGAAACCT TAGGCTTTAA CACCGAGCGG
CGCGAAGCCT TTTACCAGCT GTTTATCAAT AAACCCTACG ATGAACTCGT TCGGGATAAC
AGCCAGTTTT GGTTGACCTC TGGCATTAAC ATGCAACTCT CGGCCAAAGG GTTAAATCTG
CAGGTGGGGT CGCTCGAAAC CTTGCTCTCT GGCGGTGTGA GTTTCTGTTT GCCTGAAGGG
CGTCTCGCTG GGGCAAAAAT CACAGAGGAT GGCCATGATT TTAGGTTGTA TGACTCGCAG
GAGCTTGCGA GCCAAAGCGT ATACGACAAG TACTTAGAAT TCGTGATGTT ATTCGATGAG
TCTATTCGTG GATTACATGA CGGTGCGACG GTGGAGTTCC GTGGCATCAC CATAGGCGAA
GTGGTGAAAT CACCGCTCAC TTTGCAGCAA CTCGATCCGC ATTTTGGTCG CTTTAGCCAT
GGCACGATTC CCGTATTGGT CAAAATCGAA CTAGCGCGGG TGTTTGAGCA TGCCGAACAG
GTAGGCTTGG ACAATTTACG CGCTGAAATC GAGCGAGAGT TGCACTCGGG TCTGCGGGCG
AGCCTGAAAA CCGGCAATTT ACTCACGGGT GCCTTGTTTA TCGATCTGGA TTTATACGCC
GATGCTAAGC CCTATCAAAC CGCCGATTTT ATGGGCTATC CGGTATTCCC GACGCAGCGC
GCGGGTGTGG CTGAGATCCA AAAGCAAGTT GGGCAGCTTA TCAGCAAGCT CAATAATTTA
CCGCTGGAAA AAACCTTCTC AGAGGTCAAT ACCACGCTAC AAAATACCGC CAGTGCATTG
GCGCAGTGGG ATAAGGTGGG CGCGAGTTTA GACCAAGTGT TACAGCAGCA AGAAATGATG
TCGCTGCCCG CAGAAATACA ACAAACTTTG AAAGCCGTAA GCTTGACGGC GAAGGGATAT
GGGCCAGAGT CGAGCGTTTA TGCCGAACTT CAGACCAGTC TGCAGCAACT GCAAACCTTA
ATGAAAGAGT TAGCGCCCTT GTCGCGTCAG CTGAATCAAA AGCCTAACGC ACTTATTTTA
GGGGCAGATC TGCCCGCAGA CCCCATCCCA GTTAAAGGTA ACTAA
 
Protein sequence
MSSNMINNEN IRAKESKVRQ LSPVWLIPMI AVLIGCWMLY SYFSQLGTEI QLHLKTAEGI 
EVGKTFLKSR NVNVGVIESI KLSDDYSAIV ATARISNDAK RMLKQDARFW VVKPRIGMEG
VSGLDTLLSG AYIELEPGKS STPQYEFTVL DNPPVASADE EGMRITLTSP QAGKLSVGDP
VLYEGFRVGR VETLGFNTER REAFYQLFIN KPYDELVRDN SQFWLTSGIN MQLSAKGLNL
QVGSLETLLS GGVSFCLPEG RLAGAKITED GHDFRLYDSQ ELASQSVYDK YLEFVMLFDE
SIRGLHDGAT VEFRGITIGE VVKSPLTLQQ LDPHFGRFSH GTIPVLVKIE LARVFEHAEQ
VGLDNLRAEI ERELHSGLRA SLKTGNLLTG ALFIDLDLYA DAKPYQTADF MGYPVFPTQR
AGVAEIQKQV GQLISKLNNL PLEKTFSEVN TTLQNTASAL AQWDKVGASL DQVLQQQEMM
SLPAEIQQTL KAVSLTAKGY GPESSVYAEL QTSLQQLQTL MKELAPLSRQ LNQKPNALIL
GADLPADPIP VKGN