Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4471 |
Symbol | |
ID | 4246124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6897392 |
End bp | 6900367 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638109354 |
Product | hypothetical protein |
Protein accession | YP_723931 |
Protein GI | 113477870 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.756132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0586303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCCG ATATTACTGA AAAAGGTCTC GAAAATATTA TCTATCAAAG TCTCATCGAC GACTGCCAAT ATTTAGAAGG TAACCCCAAA GACTACGACC AAACCTACTG TATCGACACT GAGAAACTTT TCCAGTTTCT CCAAAACACC CAACCTGAAA AATTAACAGA AATTTCTAAC TACCACGGCG CCAACTGGGA GAAAAAACTT TATGAACGCC TCCACCACCA AATAGAAGAG AAAAGTATAG TCAATATATT ACGTCAAGGT ATCAAAACTG GAGAAACCCA CCTCGAACTT TACTATAAAC TTCCCACTTC CCAACTCAAC CCCGACACTA TCGAAAATTT CCAAGAAAAC GTTTTTTCAG TCACTCGCCA ACTAAAATAC AAAGAAAACC GTAACTTCTC CCTCGACTTA GTAATTTTTA TAAACGGTTT ACCAGTTATT ACCTTCGAGC TAAAAAACCA ACTAACCAAA CAAAACTTTC GAGACGCCAT AAACCAATAT AAAAATGACC GACGCCCCAG AGAATTATTA TTTCAATTCA AACGTTGCCT AGTACATTTT GCCCTCGACG CCGATGAAGT TTGGATGACA ACAAAACTCA ACGGCAAAAA TACAGAATTC ATACCATTCA ACAAAGGCAA AAAATCAAAT CCTGATCTAC CTTTTCCGGA TACAGCAGGG AACCCTCCCA ACCCCAACCA CATCAAAACA GATTATTTGT GGAAAGAAAT TTTAACCATA GAAAGTCTCG GAAACATCAT CGAACATTAC GCCCAACTGA TAGAAAAAGA AGAAGATAAA GACAAAGACA AAAAAACAGT CAAAAAGCTA AAACTAATCT TCCCCCGCTA CCATCAACTC GACCTAGTCA AGCAACTTTT AACAAGTGCA AAAAAACATG GAGTCGGCAA CCGCTACTTA ATCCAACATT CTGCAGGTTC CGGCAAAAGT AATTCTATAA CCTGGCTGAG TCATCAACTC GTAGAACTGA AAAATATTAC CGAGAAAGAA AATATTTTTG ATTCAGTTTT AGTCGTGACA GACCGCAAAA TTTTAGATAA ACAAATTAGG GAAAATATTC AACAATTCGC TCAAGAAGAC AAAGTCGTAG AAGCAACCAA AAACAGCAAA AAATTAAAAT CAGCCTTGGA AAACAAACGG AAAATTATTA TTACAACAGT GCAAAAATTT CCATATGTTG TCAAAGAAAT TCAATCTTTA TCCGATCACA AGTTTGCCAT TATTATCGAC GAAGCACATT CGAGTCAAAC TGGCAAAAGT GCAGCCAGCA TGAGTGAATC TTTGAGCAAA AAAGATTCGG AAGTAGAAGA AACCACAGAG GATAAAATAA TCCGAATTAT TGAGTCACAA AAACTTTGCC CAAATGCTAA TTATTATGCA TTTACTGCCA CGCCAAAGAA TAAAACTTTA GAGTTATTTG GTGTCAAAAA TCCAGAAGAT GGAAAATTTT ATCCGTTCCA TAGTTATTCC ATGAAGCAGG CAATTGAAGA AGGATTTATT CTGAATGTTT TGCAGCATTA TACGACCTAC AAAACCTATT GTCGATTAGA GAAGAAAGTT ATAGACGACC CTGAATTTGA TAGTAAACAA GCAAAAAAGA AGTTAAAACA ATATGTAGAA GAGGATCAAG AGAGTATCCG CAAAAAGTCA GAAGTGATGA TTGAGCATTT TTTATCAAAG GTAATTGCTC AGGGAAAAAT TAATGGAAAA GCTAAGGCTA TGGTCGTTAG TAATAGTATT AAAAGTGCGA TTTATTATAA AAAAGCTTTT GATAAATATT TGAGAGAAAA AAAATCTGAT TATCAGACTA TTGTTGCTTT TTCTGGAAGT AAAGAAATAG ACGGCAAAAA GGAAAATGAG TCTTCTATGA ATGGATTTTC TAGTAGTAAG ATTACAGAAA AATTTAATGA TAGTAAATAT AGGTTTTTAA TTGTGGCTAA TAAGTATCAA ACTGGTTTTG ATGAACCGTT GTTACATACT ATGTATGTGG ATAAAGTTTT ATCTGATGTG AAAGCAGTAC AAACTTTGTC TAGGTTAAAC CGTTCTTGTG AGGGAAAAAC AGATACTTTT GTTTTAGATT TTGTTAATTC TGCTGATGAA ATTCAGAGAG CTTTTGAACC TTATTATAAA ACAACTATTT TGAGTGAAGA AACAGATAGC GATCGCCTCT ATGATTTAGA GGATAGTTTA GCAAGTTTTC AGATTTATTC TCAAGAAAAT GTAGAGAAAT TTATGAAGCT TTTTTTGAAT TGTGAGTCAC GGGAAAATTG GGAGTCAATT TTAGATATTT GTGTGGAAAA ATATAATTGT GATTTGCTAG AGGAGGAAAA AATAGAGTTT AAAAGTAAAG CCAGGAGTTT TGTGAAAAAT TATCAATTTT TGGTGCAAGT AAAAAGTTTT AAAAATTCCA ATTGGGAGAG TTTAAATAGT TTTCTGAAAT TGTTAGTTAA TAAACTGCCA CAATTAGATA ATTCTGATTT ATCGGCAGGA ATTATTAATA GTGTGGATAT TGAGAGTTAT CGAGTAGAGC TTCTAGCTAG TCAAAGTATT AATTTAAGTG GAGAAAATAC CCTATCTCCC ATTGCCAAGA ATATTGTTAG TGGAAATTCT CAAAGTAGGT CAGATAAAGT TAGTCAAATA ATCGAAGAAT TTAATAACCG CTTCGGTGGT AATATTGTTT GGCAAAATGA GGGTAGGGCA TGGAAATTTT TATTAGAGGA GTTGCCAGAA AAAGTCAGAG GAAATGGGGA GTATAAAAAT GCTATAAATT ATAGCGATCC GCAAAATGCC AAACTTACCT TTGAAAATAA ATTCAATCAA GAATTACGGC GTTCTACCCG TGAACATATA GAAGAATATC GTCAATTTAC AGGTAATAAA AGTTTTCGAG AATGGTTAAT TAATACTTTA TTTAATCTTG ACTACGAGCA AGATAAAAAT GCTTAG
|
Protein sequence | MASDITEKGL ENIIYQSLID DCQYLEGNPK DYDQTYCIDT EKLFQFLQNT QPEKLTEISN YHGANWEKKL YERLHHQIEE KSIVNILRQG IKTGETHLEL YYKLPTSQLN PDTIENFQEN VFSVTRQLKY KENRNFSLDL VIFINGLPVI TFELKNQLTK QNFRDAINQY KNDRRPRELL FQFKRCLVHF ALDADEVWMT TKLNGKNTEF IPFNKGKKSN PDLPFPDTAG NPPNPNHIKT DYLWKEILTI ESLGNIIEHY AQLIEKEEDK DKDKKTVKKL KLIFPRYHQL DLVKQLLTSA KKHGVGNRYL IQHSAGSGKS NSITWLSHQL VELKNITEKE NIFDSVLVVT DRKILDKQIR ENIQQFAQED KVVEATKNSK KLKSALENKR KIIITTVQKF PYVVKEIQSL SDHKFAIIID EAHSSQTGKS AASMSESLSK KDSEVEETTE DKIIRIIESQ KLCPNANYYA FTATPKNKTL ELFGVKNPED GKFYPFHSYS MKQAIEEGFI LNVLQHYTTY KTYCRLEKKV IDDPEFDSKQ AKKKLKQYVE EDQESIRKKS EVMIEHFLSK VIAQGKINGK AKAMVVSNSI KSAIYYKKAF DKYLREKKSD YQTIVAFSGS KEIDGKKENE SSMNGFSSSK ITEKFNDSKY RFLIVANKYQ TGFDEPLLHT MYVDKVLSDV KAVQTLSRLN RSCEGKTDTF VLDFVNSADE IQRAFEPYYK TTILSEETDS DRLYDLEDSL ASFQIYSQEN VEKFMKLFLN CESRENWESI LDICVEKYNC DLLEEEKIEF KSKARSFVKN YQFLVQVKSF KNSNWESLNS FLKLLVNKLP QLDNSDLSAG IINSVDIESY RVELLASQSI NLSGENTLSP IAKNIVSGNS QSRSDKVSQI IEEFNNRFGG NIVWQNEGRA WKFLLEELPE KVRGNGEYKN AINYSDPQNA KLTFENKFNQ ELRRSTREHI EEYRQFTGNK SFREWLINTL FNLDYEQDKN A
|
| |