Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_3732 |
Symbol | |
ID | 4921026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | + |
Start bp | 4444955 |
End bp | 4446766 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640165358 |
Product | collagenase |
Protein accession | YP_001095857 |
Protein GI | 127514660 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00527905 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00286894 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGCGTC AATCCCTCTG CGCCCTCTTG CTCGTCGGCC CCAGCTTCTT GGTGGGCTGC AGCGTCACTG CACCAGGACA GATAACCCCA ACCCCAAGAC AGATAGACTC GATCTTTATC AATGAGCTAA GCGCCTTGCC GTCACAGCAG CTCTTCAGCC ATGACAACCC CCTGCTCGAT ACCGCCAGCT TAACTAAGGC ACTACAAGCC ATCATGCAGG CCCAGGATGA GGCGAGCCTA GATGCCCTGC TCTATTACCT AAGGGCCTTT AGCTATTTCG GCCCCATGGA TAAGCTGACC GATACCGAAT ATACCGCTCT GACAGATGCC TTAAGCCGAC TTGGCAATAG TCAGTTGATC GCTCAGGCGC CTAGAATACA GGAGCACTTC GCGGTAACCC TGTATCGTTA CTATGGAATT GATGACAAAG ATGACAAAGT CAATGACCAG CGCGCCCAGC AGCTTGCTAG CTTGCTCCCC CTACTTAATC GGCAACTAAA GAGAACCGCA AAGCAAGCGC CTAGTCAGGC AGCAGATTAT GCCCTATGGG AGACACTCAG AGCCTATGGC ATGCTGCTAA ATATCGCGCG CAAACAGCCC GATCGCGCAC TCAATAAGCA GTTGATCGCC GCCGAACTGG ATAAGCCCCT GCTCGACTTT GCCGCCTCAG CACTAAGCCT GCATGGTCAG CAAGACTGGC CGAGAGCCAA TGCCTATTGG GCACTGGGCC TCTACCGCCT AGCACTCCCC AGCGGCGAAG AGGGCAAGCT CACCCCAGCA GAACAGGCAC TGGACGATGC CATGGTAAGA CTCGCCGAGC AGGATGTCGC CCAACGTGGT GAGGCTGCCA AGACCACCTA TACCTTGGGC TATCATGTGA ACGCCTTCGC CGGCCTGGAG GCCTGCAAGG CCCGCAGCAG TGTCTGTCAG ATCCCAAAGC TGACACAAGT CTTACCCATA CACCATCAAT GTTCAGATAG CCTATATATC CTGGCGCAAG ATCTCACCCC TAATGAGCTG AGTAGCAGCT GCACTAAGCT CACCTCACAG GAGGCCAACT TTCATCGACT GCTCGAGACT CAGCACCAGC CGACCGCCAA CGACAACAAT GATGCGCTGC AGGTAGTGGC CTTCAAGAAT TGGAGCCAAT ACAACGCCTA CGGCCAGCTG CTGTTCGACA TAGGCACGGA TAACGGCGGC ATGTATATCG AAGGCACGCC GTCAAATCCC GAAAATCAGG CCAGCTTCTT CGCCTATCGA CAGTTTTGGA TCGCGCCCGA GTTTGCCATC TGGAACCTGA ATCATGAATA TGTGCACTAT CTCGACGGCC GCTTCGTGAA ATATGGCGGC TTCGGCCATT TCCCGCGCAA GCTGGTGTGG TGGTCGGAAG GGCTGGCCGA GTACGTCTCC AAGGGCAATG ACAATCCATC GACCCTGAAG GTCATCAAGG CTAAACTGGG CGAGGCGCCG GATCTCAAGA CCATCTTCGC CACCGAATAT AAGGATGGCC TCGACAGAAC CTATAAGTGG AGTTACATGG CCATCCGCTT CCTGGCCGAG CACCACCCAG AGGAGTTGGT GCGTCTCAGC CATTACCTGA AGACAGATTA TTTCGAGGGC TATGAGAAGC TGCTCGATCA ACTCTCAGAG CAGCAAAGTG CCTTTTCAAA CTGGCTAACC CAGCAGGTTG CCCAATTTGA GGAGCATGAC ACGGATAAGG CGCCCAAGAT AAACAAGCTT AATCGTTACG CCTATCGCGA TTATCTGATG CCAAGCCACC TCAAGTTAGA TGGGGCCCAT CAACATTTTT AA
|
Protein sequence | MLRQSLCALL LVGPSFLVGC SVTAPGQITP TPRQIDSIFI NELSALPSQQ LFSHDNPLLD TASLTKALQA IMQAQDEASL DALLYYLRAF SYFGPMDKLT DTEYTALTDA LSRLGNSQLI AQAPRIQEHF AVTLYRYYGI DDKDDKVNDQ RAQQLASLLP LLNRQLKRTA KQAPSQAADY ALWETLRAYG MLLNIARKQP DRALNKQLIA AELDKPLLDF AASALSLHGQ QDWPRANAYW ALGLYRLALP SGEEGKLTPA EQALDDAMVR LAEQDVAQRG EAAKTTYTLG YHVNAFAGLE ACKARSSVCQ IPKLTQVLPI HHQCSDSLYI LAQDLTPNEL SSSCTKLTSQ EANFHRLLET QHQPTANDNN DALQVVAFKN WSQYNAYGQL LFDIGTDNGG MYIEGTPSNP ENQASFFAYR QFWIAPEFAI WNLNHEYVHY LDGRFVKYGG FGHFPRKLVW WSEGLAEYVS KGNDNPSTLK VIKAKLGEAP DLKTIFATEY KDGLDRTYKW SYMAIRFLAE HHPEELVRLS HYLKTDYFEG YEKLLDQLSE QQSAFSNWLT QQVAQFEEHD TDKAPKINKL NRYAYRDYLM PSHLKLDGAH QHF
|
| |