Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3117 |
Symbol | |
ID | 4244208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4768222 |
End bp | 4771032 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638108130 |
Product | hypothetical protein |
Protein accession | YP_722723 |
Protein GI | 113476662 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATT CCCAACGTCC GACTGTTTTT ATTGAAAAAA TTATGCCTGT GAAATTATTA AATCAACAGG TTTATTATGA ACATGGAGGT AATCCTTTTA AGGGGTTGCA TCGCTGGTAT TCTCGTAAAC CTTTATCGTT TTCAAGAGCT TCGGTTTTAG CTTCTTTATT GCCTGATGAT ATTTCTGTAG AAGAGTTTGA GTATTTATTA GGGTTGAAAA AGCGGAGCAA TTCTATTCAA AAAAGTGAAC AGTATAAGGA TGATACTAAG CTTTATAAAA CTCCTCCTGA TGAAACAAGA ATTAAGCAGG TACATGACTA CTGTGAGAAA ACCTGGGGGA CCAGAACACC TACTATTTTG GATGCGTTCG GTGGTGGTGG GAGTATTCCT TTTGAAGCAG CGCGGTATGG GTTGAATGTT TTGGCATCAG ATTTGAATCC GGTGGCAGTG GTGACAATGA AAGCAGCGAT GGAATATCCT TTGAAATTCG GACCTGACTT GCAGCAGGAT ATTGATAAGT GGGTGCAGTG GGTAGGGGAT GAGGCAGAGA AACGGTTGGC TGAGTTTTTC CCGTCGTTGC CAGGGGAAAG GGTGCAGAAT TATTTGTGGG CCCATACGGT GGTTTGTCCC AGTTGTCAGT CGGTTGTGCC TTTGAGTCCG AATTGGTGGT TATATAAACG CCCGGAAAAG CAGAATTTAC ATAAATGGTG TGCGGTGAAA CCTATTCCTA ATCCTGAAGG GAAGCGGGTT GATTTTGAGT TGATAAAAGG AAGTAAGGGA AAAGGTACGA CTATTAAGAC TGATGAGGGT GAGTTCGATC CGAGTGACTA CAATACTATT AGTAGGGGTG TGGGTAAATG TTTGAATTGT GGTAATGTGA TTGAGGATGG TGTAATTAAA TCTCAGGCAC GGTCAGGAAA GCTTGGGCAT CAAATGTATG CAGTGGCATT TAAAAAAGGT AAGGGAAGTT TAGAGTTTAG ATTACCTCAA AATGTTGACT TTGATGGATT GGGTAAAACT GATTATTATT TGAATAGTAG TTTTGAGGAA TTTCAATTAA GTGGTTTATT ACCAGAAATC GAAATTAACT CTGGTGAGAA AACAGACGAA CTTATTAGAT ATGGAATTAA CCAATGGTCA AAATTATTTA ATCCCCGTCA ACTTCTAACC CTTGTCACTT ATGTCGAAAT TATTAACGAT GTTAAGTTAC AATTACAAGC AGAATATGAA CCCGATAAAG TAGAGGCGAT CGCTACTTAT TTGGCATTGA TATTGGATAG ATGCGTTGAC ATAAACAGCA GACTTACTCA CCTAAACCCA GCAGGAAGTT GGGGAATTCA AATGTCATCT GCCCAACATT CTCTTAATTT AATGTGGAAC TACGTTGAAG CATCTGGTAG TGCAAAGCTT TGGTCAGTAT ATTCTCAAAC TGTTCAAAGC GGATATCCTA AAATTTGCCA ACTCCTCAAC GCCAAACCTC TACCCATTGA CACCCAACAA CACAACAAAA CCATCCAAAT AGACTCCACA TCAGCAGACA CCCTCTACCA CATCCCCAAT AACTCAGTAG ATGCCATCAT CACCGACCCG CCCTACTACG CCACCATTCA ATACGCCGAA CTATCGGACT TTTTCTACGT CTGGCAGCGC CGAGTCTTAG GCGATATCTT CCCCGACCTC TACTTAACCG AACTCACCGA CAAAGACAGA GAAGCAGTTG CCAACCCCTC CCGCTTCCGC AACATGGGAA CATCCCCCGA TGAACTTGCA AACCAAGACT ACGAAGCAAA AATGGCACTC GCCTTTGCCG AACATTACCG AGTCTTGCGC GACGACGGCG TAATGACGGT ACAATTTAAC CACAAAGAGT CCGGCGCGTG GGATGTCTTA GCAAAATCTC TCATTGACGC TGGTTTTGAA ATCACCGCAT CTTGGGCAGT CAGTACTGAA AACCCCCAAA ACCTCCATCA AGCTAAGAAA AATTCTGTTT CCAGCACCGT CTTACTTGTC TGTCGCAAAC GTGACCCCAA CGCCCCTCAA GCATGGTGGG ATGACCTGCA ACCAGAAGTT GCCAACCAAG TAGAGGAACG CGCCCCCGAC TTTGAAAAAA ATGACATCAC GGGAATTGAC CTATATCTCA GCGCATTCGG CCCAGCATTA AACGTATTCA GTCGTTCCTA TCCCATATTA GACAACAGTG GAGTAGAAGT CCGCCCCGAA GTCGCCTTTG CTGAAGCTAG AAAAGCGATC GCTAACTACC GCTTCCAGAA ACTTGTACAA ACAGACACAG CGGGCTTTGA TATTTTGACC CAATGGTATT TATTAGCTTG GGATGCTTTC AGTGCCAGGG AGTTCCCCTT TGATGAAGCC AGACAACTTG CCCTAGCCAT AGGAGGTTTC AACGTCAACG ACCTGGTTAA AGTTCACAAA TTATTAGACT CAACGAGTGG CACTTGCAAA TTATTAACAC CCCGACAACG ACTGAAAAAA CGAGCATTTT CGGTCACTCC ACAAGATTTT TCTAGTCAAT ATTTAGTAGA TGACATTCAT GCTATTATTG CTATTTATCA AGAAGAGGAA AATGTAGAAG TAGTCCGTCG GTTTATGGAA AAAACAGGAT TATTAAGCAA TGAAATGTTT ATGCAAACTA TTGAAGTAGC ATTAAAAGTA ATTCCCGATA AAATAGAAGA GGAACAAACC TTGATGAATT TGTGTTTAAT GATGGATGAA ATTAAAGATA ATGTCAGTAC CCAAGGGAAA CAATTAGAAT TATTTGAACA GCAGTTAAGC TTAGATTTTG GAGATGTTTA A
|
Protein sequence | MNNSQRPTVF IEKIMPVKLL NQQVYYEHGG NPFKGLHRWY SRKPLSFSRA SVLASLLPDD ISVEEFEYLL GLKKRSNSIQ KSEQYKDDTK LYKTPPDETR IKQVHDYCEK TWGTRTPTIL DAFGGGGSIP FEAARYGLNV LASDLNPVAV VTMKAAMEYP LKFGPDLQQD IDKWVQWVGD EAEKRLAEFF PSLPGERVQN YLWAHTVVCP SCQSVVPLSP NWWLYKRPEK QNLHKWCAVK PIPNPEGKRV DFELIKGSKG KGTTIKTDEG EFDPSDYNTI SRGVGKCLNC GNVIEDGVIK SQARSGKLGH QMYAVAFKKG KGSLEFRLPQ NVDFDGLGKT DYYLNSSFEE FQLSGLLPEI EINSGEKTDE LIRYGINQWS KLFNPRQLLT LVTYVEIIND VKLQLQAEYE PDKVEAIATY LALILDRCVD INSRLTHLNP AGSWGIQMSS AQHSLNLMWN YVEASGSAKL WSVYSQTVQS GYPKICQLLN AKPLPIDTQQ HNKTIQIDST SADTLYHIPN NSVDAIITDP PYYATIQYAE LSDFFYVWQR RVLGDIFPDL YLTELTDKDR EAVANPSRFR NMGTSPDELA NQDYEAKMAL AFAEHYRVLR DDGVMTVQFN HKESGAWDVL AKSLIDAGFE ITASWAVSTE NPQNLHQAKK NSVSSTVLLV CRKRDPNAPQ AWWDDLQPEV ANQVEERAPD FEKNDITGID LYLSAFGPAL NVFSRSYPIL DNSGVEVRPE VAFAEARKAI ANYRFQKLVQ TDTAGFDILT QWYLLAWDAF SAREFPFDEA RQLALAIGGF NVNDLVKVHK LLDSTSGTCK LLTPRQRLKK RAFSVTPQDF SSQYLVDDIH AIIAIYQEEE NVEVVRRFME KTGLLSNEMF MQTIEVALKV IPDKIEEEQT LMNLCLMMDE IKDNVSTQGK QLELFEQQLS LDFGDV
|
| |