Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_2020 |
Symbol | |
ID | 4476385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 2412921 |
End bp | 2416034 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 639726603 |
Product | phage integrase family protein |
Protein accession | YP_869657 |
Protein GI | 117920465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00692434 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000092621 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTTGAAG TTAAAATCAG TTCTTTCATA AATGATTTAA TTATTAAAGA TGAAAGGCAT CCTTTATATC AAGAGACATT AGTTGCCTAT TTAGCTAAAA ATTGTCATAC CATTTGTGAT GAGTCAAAAG GAGTTGATAC TAAAAAAATA TTTTATTCAC TATATAAAAA CTTGCTAGGA CATTTGAAAA ATCATTTCCT GTTATCAGAG ACTAATTTTA TCCAAGAAGA GTTAGAATCT TTAATTTCTC ATTTAAAGCC AGATGTTATT GGAAAACAAA CCCAGCTGCT AATATTGCAA GCACTGCTTA ATTATGCAAA AAAACAATTT AATTTTGACT CTCCAAATAT ACCTGTCATA GTGACATTAA AACGTGATAA ACCAATACTA TCTCCAGTTG AACTTATTAA ACTTCCTATT GTAGAACAGC TTAATACTAT CATAGATAGT GAACTTAACG TACCCAGTAG GATGCTAACT GTGGATGCTA AACTAGGTCG AACTGCTTTA TTAGTATATT GGACCGTAGG GTTGAGTAAA ACCGAAGAAT TGATTTCGAT TCTTGAATTC CCGCAAAATA TTTTTTATGT TGGTGGTCTT TGTTATTGGC AGTCAGAGAA GAGATATACA TACTCGACTA GGTTCATATT AAGTGATGTT GCAGTAGTTG CCTTGCAACA ATGGAATGTT TTAAATGTAA ATAAATCTTT TAAGGTTATG GATTTAGTAG TAAAATATTT AAATTTTGTA TCTGATTTCG ATTGGTCAGC GTTATCTATT CTAAAATTAA GAACATTACG GAAAATAGAT AATGTCTTGC GCTACGGTCC AGTGCAGTAT CAGATGTATA TTCTACCAAG GGTCAGTCAA GCATTGCCTG AGCATGCGTT TTGTCGTTTG CTAACGGACA AAGCCTGTCG TAATTTAGAA CCTCGTATCT TAGAACCTAG TAGCGATAAT TTAATCCTGT CTAAACCCTG GAAAATGATC AATGCTGGAG ATACCCCCTT CATCGATATC AAAATCATAT TAAAAAATTT GGACAAACTT TTTGAGCGAC TACTGAAATT TAGGTTAGAG AACAATGTTC GAGAAAAATG TATTAAATTT TTGAAAAACA CTATAAAAAA ACCAGAAATC GCCGCCGTAC CTTACTTTTG GCTTTTATGT TCATGGTTAT ACTCTTTGTT AAAGCATGGA GGAACTTCTA AAAGTCGTTT AAAACTATCT ACTATTATCG ACTACGTTAA AAGTCTTAGT AAGCCTTTTC TAACAGTTTT TAGCACCTGT AATATTGGAT TGTTATCTGG TGAAGATTGG GTTAGTAAAT TGAATGATTC GGCTGAATTA TTTTCTTCAG CACAACGTAA AAAATATGTC TATTACTTTG CACAATTTTT GATTGACAAC AGGTTAGTTC GGGATCTATG CCTTTCTGAT ATCGATATAA TAGGCAGTTC AAGTCAAGTT GATGCGAACA TGATATCTCC TACCCACATT GATGAAATTT TGAACTATTT AACTCAACAT TTTAATGATG GACTAGTTTA TCATTATGCG TATTTTTTAT TGTGCTTCTG CTTTTTTAGC GGTTTGCGCA GAAATGAAGC TGCAAAATTA ACGTGGTCGG ATTTTTCATT TTCAATTTTA TTTCCATCTA AGCATGAGTT TGATTATGTT AAGCTATCAG TAAGACCTAA CCAGCATAGA ACGTTGAAAA CATCTTCTGC TCGTAGAGAG ATACCATTAG ATGCATTTTG GCCTAAAATC GCGATAGAAA AACTTAGAAA AAAGTATCAA ATTCATCATA AAATGAAACA AAGTAAAAAC GCCCTACTAT TTGATAATCA AAAAGTTGCA AATCAAGCTT ATGATTTAAT AACCGATTTA ATGCGTCATT ATACTCATGA CTATAGCTTA CGCGTTCACC ATCTACGACA TAGTTTTGCC AATTGGTCAT GGTGTCGCTT GAATCCTCAT ATTATCGAGA TAGGAAAAAT TCAGCTTAAG TTATTTAATC ATGAGATCTT TAGTGCAGAA TATTTAAGCC GCTTACAGAA TCGATTATGT TACAGTAACA ATACTCGGAA AAAAATGTTT ATTCTATCCC ATTTGTTAGG GCATAAAGAT GTTTATTCGA CGCTAAATAG TTACCTGCAT CTCAAGGATG TATTGCATTA CTTAGAATTG CAACCTCGAT TTACTTTAAC TAAGTACTTT TGTTCAGAGT GTATAGGTCG ATCTACATTA ATGGAGCAGG AGCAGGAATT AAGTCTGGCA GAGAGAATAC AGTATTATAC TAAAGATACT GAAACTAAGT TGAGCATTAA ACCTGCGCCA TTAATTTCAT CATTAAAAAT TCCGACTTTA GCAAACTTTG TTAAGGAAAT TCACCCTACT ATAAAAGTCA GTAGTTTGAC ATGGGCTAAA GCACTCGCTG CTTTAAAATT CTGTTCGATA GTTGACTCAT CTCATACTTT CAATGTCCCA GTTGAACAAT TACAGCAAAT TCTTACTAAT GCCGAGCAAG TACATAACAA CTACCCTCGA ATAGGAAAAC CTCTCCCGTT AATTCCTAAC TTTCCAAGAC TTATTCATTC ATCAGATTCG AACAACACTA ATAGTTATTT AGCTAAATCA GGTAAAGTAT TTGCTTATCT ATGTAATAAA TTTGATAAAA GTATCGAAGC GGAATCTTTG ACGCTGGCCA ATGTTAGGTT AAGCATGAGT ATTTTGAAAT ATGCAGTGCC GGGGAAGGGT TATGCTTTGC GATGCCCTGA GTCCAACGTT TCTAGGATGT TTGTCCGTTT TTGTCAGTTA CTGGGACTAA AAGACCGCCA TTTAAGGTTT AAATATCATA GAGCAGATCT CACTCTTGAA GCTTCAAATA GAATAGAAGA TAGATGGATA AAAACCATTG TGGATTATGG GTTTAGTGAG AAAAACTTTA CTATTGCGAG TGAAAGTGAA GGTCGATTTT TAGGAAAGCA TGATGGAAAT GGATTTCTTG AGATATTGCT AGTTAACAAT GCGTACAAAC GTGTACAGCG ACATCAAAGT CTGTTTAGTT TCTTACACTT AATATTAATT TTTAGCTACA ATGAAAAGAA ATGA
|
Protein sequence | MVEVKISSFI NDLIIKDERH PLYQETLVAY LAKNCHTICD ESKGVDTKKI FYSLYKNLLG HLKNHFLLSE TNFIQEELES LISHLKPDVI GKQTQLLILQ ALLNYAKKQF NFDSPNIPVI VTLKRDKPIL SPVELIKLPI VEQLNTIIDS ELNVPSRMLT VDAKLGRTAL LVYWTVGLSK TEELISILEF PQNIFYVGGL CYWQSEKRYT YSTRFILSDV AVVALQQWNV LNVNKSFKVM DLVVKYLNFV SDFDWSALSI LKLRTLRKID NVLRYGPVQY QMYILPRVSQ ALPEHAFCRL LTDKACRNLE PRILEPSSDN LILSKPWKMI NAGDTPFIDI KIILKNLDKL FERLLKFRLE NNVREKCIKF LKNTIKKPEI AAVPYFWLLC SWLYSLLKHG GTSKSRLKLS TIIDYVKSLS KPFLTVFSTC NIGLLSGEDW VSKLNDSAEL FSSAQRKKYV YYFAQFLIDN RLVRDLCLSD IDIIGSSSQV DANMISPTHI DEILNYLTQH FNDGLVYHYA YFLLCFCFFS GLRRNEAAKL TWSDFSFSIL FPSKHEFDYV KLSVRPNQHR TLKTSSARRE IPLDAFWPKI AIEKLRKKYQ IHHKMKQSKN ALLFDNQKVA NQAYDLITDL MRHYTHDYSL RVHHLRHSFA NWSWCRLNPH IIEIGKIQLK LFNHEIFSAE YLSRLQNRLC YSNNTRKKMF ILSHLLGHKD VYSTLNSYLH LKDVLHYLEL QPRFTLTKYF CSECIGRSTL MEQEQELSLA ERIQYYTKDT ETKLSIKPAP LISSLKIPTL ANFVKEIHPT IKVSSLTWAK ALAALKFCSI VDSSHTFNVP VEQLQQILTN AEQVHNNYPR IGKPLPLIPN FPRLIHSSDS NNTNSYLAKS GKVFAYLCNK FDKSIEAESL TLANVRLSMS ILKYAVPGKG YALRCPESNV SRMFVRFCQL LGLKDRHLRF KYHRADLTLE ASNRIEDRWI KTIVDYGFSE KNFTIASESE GRFLGKHDGN GFLEILLVNN AYKRVQRHQS LFSFLHLILI FSYNEKK
|
| |