Gene Shewana3_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2020 
Symbol 
ID4476385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2412921 
End bp2416034 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content33% 
IMG OID639726603 
Productphage integrase family protein 
Protein accessionYP_869657 
Protein GI117920465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00692434 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000092621 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTTGAAG TTAAAATCAG TTCTTTCATA AATGATTTAA TTATTAAAGA TGAAAGGCAT 
CCTTTATATC AAGAGACATT AGTTGCCTAT TTAGCTAAAA ATTGTCATAC CATTTGTGAT
GAGTCAAAAG GAGTTGATAC TAAAAAAATA TTTTATTCAC TATATAAAAA CTTGCTAGGA
CATTTGAAAA ATCATTTCCT GTTATCAGAG ACTAATTTTA TCCAAGAAGA GTTAGAATCT
TTAATTTCTC ATTTAAAGCC AGATGTTATT GGAAAACAAA CCCAGCTGCT AATATTGCAA
GCACTGCTTA ATTATGCAAA AAAACAATTT AATTTTGACT CTCCAAATAT ACCTGTCATA
GTGACATTAA AACGTGATAA ACCAATACTA TCTCCAGTTG AACTTATTAA ACTTCCTATT
GTAGAACAGC TTAATACTAT CATAGATAGT GAACTTAACG TACCCAGTAG GATGCTAACT
GTGGATGCTA AACTAGGTCG AACTGCTTTA TTAGTATATT GGACCGTAGG GTTGAGTAAA
ACCGAAGAAT TGATTTCGAT TCTTGAATTC CCGCAAAATA TTTTTTATGT TGGTGGTCTT
TGTTATTGGC AGTCAGAGAA GAGATATACA TACTCGACTA GGTTCATATT AAGTGATGTT
GCAGTAGTTG CCTTGCAACA ATGGAATGTT TTAAATGTAA ATAAATCTTT TAAGGTTATG
GATTTAGTAG TAAAATATTT AAATTTTGTA TCTGATTTCG ATTGGTCAGC GTTATCTATT
CTAAAATTAA GAACATTACG GAAAATAGAT AATGTCTTGC GCTACGGTCC AGTGCAGTAT
CAGATGTATA TTCTACCAAG GGTCAGTCAA GCATTGCCTG AGCATGCGTT TTGTCGTTTG
CTAACGGACA AAGCCTGTCG TAATTTAGAA CCTCGTATCT TAGAACCTAG TAGCGATAAT
TTAATCCTGT CTAAACCCTG GAAAATGATC AATGCTGGAG ATACCCCCTT CATCGATATC
AAAATCATAT TAAAAAATTT GGACAAACTT TTTGAGCGAC TACTGAAATT TAGGTTAGAG
AACAATGTTC GAGAAAAATG TATTAAATTT TTGAAAAACA CTATAAAAAA ACCAGAAATC
GCCGCCGTAC CTTACTTTTG GCTTTTATGT TCATGGTTAT ACTCTTTGTT AAAGCATGGA
GGAACTTCTA AAAGTCGTTT AAAACTATCT ACTATTATCG ACTACGTTAA AAGTCTTAGT
AAGCCTTTTC TAACAGTTTT TAGCACCTGT AATATTGGAT TGTTATCTGG TGAAGATTGG
GTTAGTAAAT TGAATGATTC GGCTGAATTA TTTTCTTCAG CACAACGTAA AAAATATGTC
TATTACTTTG CACAATTTTT GATTGACAAC AGGTTAGTTC GGGATCTATG CCTTTCTGAT
ATCGATATAA TAGGCAGTTC AAGTCAAGTT GATGCGAACA TGATATCTCC TACCCACATT
GATGAAATTT TGAACTATTT AACTCAACAT TTTAATGATG GACTAGTTTA TCATTATGCG
TATTTTTTAT TGTGCTTCTG CTTTTTTAGC GGTTTGCGCA GAAATGAAGC TGCAAAATTA
ACGTGGTCGG ATTTTTCATT TTCAATTTTA TTTCCATCTA AGCATGAGTT TGATTATGTT
AAGCTATCAG TAAGACCTAA CCAGCATAGA ACGTTGAAAA CATCTTCTGC TCGTAGAGAG
ATACCATTAG ATGCATTTTG GCCTAAAATC GCGATAGAAA AACTTAGAAA AAAGTATCAA
ATTCATCATA AAATGAAACA AAGTAAAAAC GCCCTACTAT TTGATAATCA AAAAGTTGCA
AATCAAGCTT ATGATTTAAT AACCGATTTA ATGCGTCATT ATACTCATGA CTATAGCTTA
CGCGTTCACC ATCTACGACA TAGTTTTGCC AATTGGTCAT GGTGTCGCTT GAATCCTCAT
ATTATCGAGA TAGGAAAAAT TCAGCTTAAG TTATTTAATC ATGAGATCTT TAGTGCAGAA
TATTTAAGCC GCTTACAGAA TCGATTATGT TACAGTAACA ATACTCGGAA AAAAATGTTT
ATTCTATCCC ATTTGTTAGG GCATAAAGAT GTTTATTCGA CGCTAAATAG TTACCTGCAT
CTCAAGGATG TATTGCATTA CTTAGAATTG CAACCTCGAT TTACTTTAAC TAAGTACTTT
TGTTCAGAGT GTATAGGTCG ATCTACATTA ATGGAGCAGG AGCAGGAATT AAGTCTGGCA
GAGAGAATAC AGTATTATAC TAAAGATACT GAAACTAAGT TGAGCATTAA ACCTGCGCCA
TTAATTTCAT CATTAAAAAT TCCGACTTTA GCAAACTTTG TTAAGGAAAT TCACCCTACT
ATAAAAGTCA GTAGTTTGAC ATGGGCTAAA GCACTCGCTG CTTTAAAATT CTGTTCGATA
GTTGACTCAT CTCATACTTT CAATGTCCCA GTTGAACAAT TACAGCAAAT TCTTACTAAT
GCCGAGCAAG TACATAACAA CTACCCTCGA ATAGGAAAAC CTCTCCCGTT AATTCCTAAC
TTTCCAAGAC TTATTCATTC ATCAGATTCG AACAACACTA ATAGTTATTT AGCTAAATCA
GGTAAAGTAT TTGCTTATCT ATGTAATAAA TTTGATAAAA GTATCGAAGC GGAATCTTTG
ACGCTGGCCA ATGTTAGGTT AAGCATGAGT ATTTTGAAAT ATGCAGTGCC GGGGAAGGGT
TATGCTTTGC GATGCCCTGA GTCCAACGTT TCTAGGATGT TTGTCCGTTT TTGTCAGTTA
CTGGGACTAA AAGACCGCCA TTTAAGGTTT AAATATCATA GAGCAGATCT CACTCTTGAA
GCTTCAAATA GAATAGAAGA TAGATGGATA AAAACCATTG TGGATTATGG GTTTAGTGAG
AAAAACTTTA CTATTGCGAG TGAAAGTGAA GGTCGATTTT TAGGAAAGCA TGATGGAAAT
GGATTTCTTG AGATATTGCT AGTTAACAAT GCGTACAAAC GTGTACAGCG ACATCAAAGT
CTGTTTAGTT TCTTACACTT AATATTAATT TTTAGCTACA ATGAAAAGAA ATGA
 
Protein sequence
MVEVKISSFI NDLIIKDERH PLYQETLVAY LAKNCHTICD ESKGVDTKKI FYSLYKNLLG 
HLKNHFLLSE TNFIQEELES LISHLKPDVI GKQTQLLILQ ALLNYAKKQF NFDSPNIPVI
VTLKRDKPIL SPVELIKLPI VEQLNTIIDS ELNVPSRMLT VDAKLGRTAL LVYWTVGLSK
TEELISILEF PQNIFYVGGL CYWQSEKRYT YSTRFILSDV AVVALQQWNV LNVNKSFKVM
DLVVKYLNFV SDFDWSALSI LKLRTLRKID NVLRYGPVQY QMYILPRVSQ ALPEHAFCRL
LTDKACRNLE PRILEPSSDN LILSKPWKMI NAGDTPFIDI KIILKNLDKL FERLLKFRLE
NNVREKCIKF LKNTIKKPEI AAVPYFWLLC SWLYSLLKHG GTSKSRLKLS TIIDYVKSLS
KPFLTVFSTC NIGLLSGEDW VSKLNDSAEL FSSAQRKKYV YYFAQFLIDN RLVRDLCLSD
IDIIGSSSQV DANMISPTHI DEILNYLTQH FNDGLVYHYA YFLLCFCFFS GLRRNEAAKL
TWSDFSFSIL FPSKHEFDYV KLSVRPNQHR TLKTSSARRE IPLDAFWPKI AIEKLRKKYQ
IHHKMKQSKN ALLFDNQKVA NQAYDLITDL MRHYTHDYSL RVHHLRHSFA NWSWCRLNPH
IIEIGKIQLK LFNHEIFSAE YLSRLQNRLC YSNNTRKKMF ILSHLLGHKD VYSTLNSYLH
LKDVLHYLEL QPRFTLTKYF CSECIGRSTL MEQEQELSLA ERIQYYTKDT ETKLSIKPAP
LISSLKIPTL ANFVKEIHPT IKVSSLTWAK ALAALKFCSI VDSSHTFNVP VEQLQQILTN
AEQVHNNYPR IGKPLPLIPN FPRLIHSSDS NNTNSYLAKS GKVFAYLCNK FDKSIEAESL
TLANVRLSMS ILKYAVPGKG YALRCPESNV SRMFVRFCQL LGLKDRHLRF KYHRADLTLE
ASNRIEDRWI KTIVDYGFSE KNFTIASESE GRFLGKHDGN GFLEILLVNN AYKRVQRHQS
LFSFLHLILI FSYNEKK