Gene Shewana3_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_4109 
Symbol 
ID4480323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4931315 
End bp4934083 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content49% 
IMG OID639728724 
ProductDNA polymerase I 
Protein accessionYP_871732 
Protein GI117922540 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000411128 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.224079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACCA TAGCCAATAA CCCACTTGTC CTTGTGGATG GATCTTCTTA TTTATATCGC 
GCCTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCTACTGG TGCTGTTTAT
GGCGTAGTGA ATATGCTCCG CAGCTTATTA AGCCGTTATC AACCTAGCCA TATCGCTGTG
GTGTTCGATG CTAAAGGCAA AACCTTCCGC AATGACTTAT ATGAAGAATA CAAGGCACAT
CGCCCGCCTA TGCCGGATGA CCTGCGCTCA CAAATTGAGC CACTACACCG TATTATCCGT
GCCTTAGGCC TGCCCTTAAT CTCTATTCCT GGTGTTGAGG CGGACGATGT TATCGGCACA
ATCGCTCGCC AAGCGAGCCG CGAAAACCGC GCTGTACTCA TCAGCACTGG TGATAAAGAC
ATGGCGCAGC TGGTTGATGA AAATATCACG CTGATCAACA CCATGACAGA TACCATTATG
GGCCCTGAAG AAGTTGCGGC TAAATATGGT GTAGGTCCAG ACAGAATTAT CGATTTCTTA
GCGCTGATGG GCGATAAGGC GGATAACATT CCCGGTTTAC CTGGTGTTGG CGAAAAAACC
GCATTAGCTA TGCTCACGGG GGCGGGTAGT GTCGCCAATT TGCTTGCAGA GCCCGAAAAA
GTAACCGAAT TAGGCTTTAG GGGCGCAAAA ACCATGGCGG CGAAAATCAT CGACAATGCC
GACATGCTAA AGCTGTCCTA TGAGCTTGCC ACCATTAAAA CCGATGTTGA ACTCGAACAA
GATTGGCATG AGCTCACCGC CAAACCCGCT GACAGGGACG AACTGATCAA ATGCTACGGC
GAGATGGAGT TTAAACGCTG GCTTGCCGAA GTCTTAGATA ATAAGGCGCC AGCGACGGTC
GCAGCAAAAG CCGAAACAAC AGAGACCCAA GAAGAGTCAG CGCCCAGCGT CACGATTGAA
ACCCAATACG ATACAATTCT GACCGAAGCT CAGCTTGATG AGTGGATTGC CAAACTCAAA
CAAGCGCCAT TAATGGCCGT AGATACCGAG ACCACCAGCC TCGACTATAT GGTTGCGGAA
TTGGTTGGCC TGTCCTTTGC TGTTGAAGCG GGTAAAGCCG CCTATCTGCC CTTAGCCCAC
GATTATGTTG GCGCACCTCA ACAATTAGAT AAGCAGACTG CACTCGAAAA ACTGCGCCCC
TTACTCGAAG ATGCCAAGAT TAAAAAAGTC GGTCAAAATC TGAAATATGA CATCAGCGTA
TTAGCCAATG CAGGCATAAA ACTCCAAGGC GTGGTATTCG ACACTATGCT CGAATCCTAT
GTGTTTAACT CGATCGCCTC ACGCCATGAT ATGGATGGGT TGGCGCTAAA ATATCTGGGC
CATAAAAATA TCGCCTTTGA AGATATCGCA GGTAAAGGTG CTAAACAGCT GACCTTCAAC
CAAATTCCGT TGGAAACAGC TGCGCCCTAT GCGGCGGAAG ATGCCGATAT TACCCTACGT
CTACATCAAC ATTTGTGGCC AAGACTCGAA AAAGAGACCG AATTAGCCTC GGTCTTTACC
GATATTGAAC TGCCGCTGAT CCAAATACTG TCCGATATTG AACGCCAAGG TGTGTTTATC
GATAGTATGT TGCTCGGCCA ACAGAGTGAT GAACTTGCCC GCAAAATCGA TGAGTTAGAA
ACAAAAGCTT ATGATATTGC AGGTGAAAAA TTCAATTTAA GCTCACCAAA GCAACTACAA
GTGCTGTTTT TTGAAAAGCT GGGTTATCCG GTCATCAAAA AAACCCCTAA GGGCGCCCCC
TCTACCGCGG AAGAAGTACT GGTTGAGTTG GCATTGGATT TCCCTCTGCC TAAAGTGATC
CTTGAACATA GAAGCCTAAC CAAGCTAAAG AGTACTTACA CCGACAAGCT CCCTCTAATG
GTGAACGCGA AAACGGGTCG GGTACACACA AGCTACCATC AGGCCAACGC CGCAACGGGG
CGTTTGTCCT CGAGCGAACC AAACCTACAG AATATTCCTA TCCGCACCGA GGAAGGTCGT
CGTATTCGCC AAGCCTTTAT TGCGCCGCAG GGACGTAAGA TTTTGGCCGC CGACTATTCG
CAGATTGAAT TACGCATCAT GGCGCATTTA TCCCAAGATG CGGGCTTACT TAAAGCCTTC
GCCGAAGGTA AAGACATTCA CAGAGCCACC GCCGCCGAAG TATTTGGCAC CGACTTTGAC
AGTGTCACCT CGGAGCAGCG TCGCCGCGCC AAAGCCGTTA ACTTTGGCCT TATCTATGGC
ATGTCCGCCT TTGGATTGGC GCGTCAGCTC GATATTCCCC GCAACGAGGC ACAAACTTAC
ATCGACACTT ACTTCGCCCG CTATCCAGGC GTATTAAGGT ATATGGAAGA AACACGAGCC
AGTGCAGCAG AACTTGGCTA TGTCTCTACG CTATTTGGGC GCCGTCTCTA TTTACCTGAA
ATTCGCGATC GTAATGCAAT GCGCCGCCAA GCAGCAGAAA GAGCCGCGAT TAACGCCCCA
ATGCAAGGCA CCGCCGCGGA TATTATTAAA AAAGCCATGA TCAGCATTGC CGATTGGATA
AAAACCGATA CCCAAGGTGA AATCGCCATG ATCATGCAAG TCCACGACGA ATTAGTATTC
GAAGTCGATG CCGATAAAGC CGAAACACTC AAGCTCAAGG TGTGTGAACT CATGGCAAAA
GCAGCCAATC TGGACGTGGA ACTTCTGGCA GAAGCTGGTA TTGGCGATAA CTGGGATCAA
GCCCACTAG
 
Protein sequence
MPTIANNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL SRYQPSHIAV 
VFDAKGKTFR NDLYEEYKAH RPPMPDDLRS QIEPLHRIIR ALGLPLISIP GVEADDVIGT
IARQASRENR AVLISTGDKD MAQLVDENIT LINTMTDTIM GPEEVAAKYG VGPDRIIDFL
ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VANLLAEPEK VTELGFRGAK TMAAKIIDNA
DMLKLSYELA TIKTDVELEQ DWHELTAKPA DRDELIKCYG EMEFKRWLAE VLDNKAPATV
AAKAETTETQ EESAPSVTIE TQYDTILTEA QLDEWIAKLK QAPLMAVDTE TTSLDYMVAE
LVGLSFAVEA GKAAYLPLAH DYVGAPQQLD KQTALEKLRP LLEDAKIKKV GQNLKYDISV
LANAGIKLQG VVFDTMLESY VFNSIASRHD MDGLALKYLG HKNIAFEDIA GKGAKQLTFN
QIPLETAAPY AAEDADITLR LHQHLWPRLE KETELASVFT DIELPLIQIL SDIERQGVFI
DSMLLGQQSD ELARKIDELE TKAYDIAGEK FNLSSPKQLQ VLFFEKLGYP VIKKTPKGAP
STAEEVLVEL ALDFPLPKVI LEHRSLTKLK STYTDKLPLM VNAKTGRVHT SYHQANAATG
RLSSSEPNLQ NIPIRTEEGR RIRQAFIAPQ GRKILAADYS QIELRIMAHL SQDAGLLKAF
AEGKDIHRAT AAEVFGTDFD SVTSEQRRRA KAVNFGLIYG MSAFGLARQL DIPRNEAQTY
IDTYFARYPG VLRYMEETRA SAAELGYVST LFGRRLYLPE IRDRNAMRRQ AAERAAINAP
MQGTAADIIK KAMISIADWI KTDTQGEIAM IMQVHDELVF EVDADKAETL KLKVCELMAK
AANLDVELLA EAGIGDNWDQ AH