Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4431 |
Symbol | |
ID | 4246084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6829501 |
End bp | 6831891 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638109314 |
Product | FHA domain-containing protein |
Protein accession | YP_723891 |
Protein GI | 113477830 |
COG category | [V] Defense mechanisms |
COG ID | [COG1131] ABC-type multidrug transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATA AAATAGGTCA AACAACCCTG TTAAGTAATA ATCCTTATAT AGAATTAAAT AATCAAGGCC AAATTATCAC CTTCCAACTG ACACAAAAGC GCCATATATT AGGACGCGAT CGCCGCCGCG CCAATCTAGT AGTGCCCTCA AATTGGTTGC CCATCTCTAA TTATCATGCC CTCATCAAGA AATCTGACGA TCGCTACCAT ATTTATGATG GTGACGGTTA TCGACCAAGT ACAAACGGAT TATTTCTCAA CCAGACCCGC ATTACTCCTT CAGAAGGTCA TCCTTTGGAA AATGGCATAG AAATCAAAAT CGGCCTAGAA CCCCAAAATC AAATCTTGCT CAAATACTTT GAACCGAATA ATTCTACAGC TTTAACATCT TTACCAAAAA CGTCCTCAAT ATCTTTGCAA AATAGATCAG TATTACTTGG TCGGGATCCT GACGCTACCC TAGAGTTGAA TGCTCCTATT ATATCCCGTC ATCATGCCAC CATTGAGCCC AATGCTCAAG GTGAGTACGT TTTACAGGAC TATAGTAGCA ACGGTGTGTT TGTTAATGGG GTCCGAGTTC AAAGTACTAT TGTTGTGACG GAGGGAGCAG TAATTAAAAT CGGACCTTTT ACCCTTATTC GTCGTGGGGA TAACTTGGAA GTCCTCGACC CCGGTAACGA AATTCGTCTT GATGCCTATC GCCTCTACAG AATAGTTAGA GACCAACGGG GGAAAACACA AGTTTTACTA AATGATATCT CCCTTGCGAT CGAACCTGGG CAATTTGTTG CTTTAGTCGG AGGAAGTGGA ACTGGTAAGT CGACTTTGAT GCGAACATTA TTAGGAATAG ACCCGACAAC TAAAGGCCAA GTTTATATTA ATGGTGAAGA TTTAAGGAAT AATTTTAATA TTTATCGGAC TCAGATAGGT TATGTTCCTC AAGATGATAT TATTCATCCC GAATTAACAG TATTGGAAGC TTTAACTTAT ACTGCAAAAT TACGCTTACC TCCAGATACT AATGTTCAAG AAGTAGTGGA AAAAACTATC GAACAAATAG AGATGGCAGA GAGAAAAAAT ATATTAGTTA GTCAGTTAAG TGGAGGGCAA AGAAAAAGGG TAAGTATTGG CTTAGAATTA TTAGCAGATC CCAAATTATT TTTCTTAGAT GAACCGACAT CTGGACTCGA CCCAGGATTA GATAAAAAAA TGATGCAACT ATTAAAAAAG TTGGCAAATC AAGGACGAAC AATTATTTTA GTTACTCATG CCACAGCTAA TATTAGAATT TGCGATCGCA TAGTTTTTTT AGGTAGAAGT GGGCGTTTAT GTTACTTTGG TTCTCCGAAA GAAGCTATGA ATTTTTTCTC AATCAATACT GGTGACTTTG CTGATATTTA CAATGAACTA GAAACCAGTG ATGAAAATAT TAATGAATGG GTCAATAATT TTCGTCAATC AGAATATTAT CGGCGCTATA TTTCCAATCA TTTGAGTGTT AATAACTTAA AACCTCCTAC TAACTTACCC CCCAAAAAAC AACTCCCTTC TTTTGCTAAA CAACTATTTA TTTTAATTCA ACGATATTTA AAATTAATTT TCCGTGACCC GATTAATTTA GGTTTATCTT TATTAACTGC TCCTATTGGT ATTGGTTTAA TTTTATTTGC AGTTAGAGAT AAAAATCCAT TAATTGGCGA CCCAGAACCT ACTTTAGCCC CTTTAGCTTT GCGAGTATTA TTTGTATTTA CTTGTGCCTG TTTATGGGTT GGTCTTTCGA GTTCTCTGCA AGAAATAGTC AAAGAATCAG CTATTTATAT TCGAGAAAGA TTGGTAAATT TAGGATTATT TGCTTATCTA GCTTCAAAAG TCATAGTTTT AGGATTATTA GCAATTTTTC AAACTTTATT AATAGCAGGA GTAATTATCT TGGGATTTGA AAGCCCCCAG CCAGAAATAA TTTCATGGTC AATGGGAGTT AGTATTACTA CTTTTTTAAC TTTAATAAGT TGTATTAGTA TGGGATTAAT GATTTCTTCT ATTGTTAAAA ACGCTTCTCA AGCTAATAGT GCTTTACCCT TAATATTGCT ACCTCAGATT ATTTTTTCTG GAGTTTTATT TGAGATGAAA GGTATAGCTA GTAAATTTTC TTGGGTAATG TTAAGTCGTT GGTCTGTAGC TGCTTATGGG GCTCTAGTCA ATGTCAATAA AATGGTTCCA GAAGCAACTA AATTACCAGA TGGTAGTAGG GTTTCTTTAC CATTTTCTGG TTCAGATATT TATAACCTTA ACTGGGGAAA TTTAGGTTTA AGTTGGGGAG TTTTATGTTT ACATTCACTG ATTTATTTAG GTTTAACCAT TTGGTTTAAA AAACAGAAAG ATATTATATA A
|
Protein sequence | MSNKIGQTTL LSNNPYIELN NQGQIITFQL TQKRHILGRD RRRANLVVPS NWLPISNYHA LIKKSDDRYH IYDGDGYRPS TNGLFLNQTR ITPSEGHPLE NGIEIKIGLE PQNQILLKYF EPNNSTALTS LPKTSSISLQ NRSVLLGRDP DATLELNAPI ISRHHATIEP NAQGEYVLQD YSSNGVFVNG VRVQSTIVVT EGAVIKIGPF TLIRRGDNLE VLDPGNEIRL DAYRLYRIVR DQRGKTQVLL NDISLAIEPG QFVALVGGSG TGKSTLMRTL LGIDPTTKGQ VYINGEDLRN NFNIYRTQIG YVPQDDIIHP ELTVLEALTY TAKLRLPPDT NVQEVVEKTI EQIEMAERKN ILVSQLSGGQ RKRVSIGLEL LADPKLFFLD EPTSGLDPGL DKKMMQLLKK LANQGRTIIL VTHATANIRI CDRIVFLGRS GRLCYFGSPK EAMNFFSINT GDFADIYNEL ETSDENINEW VNNFRQSEYY RRYISNHLSV NNLKPPTNLP PKKQLPSFAK QLFILIQRYL KLIFRDPINL GLSLLTAPIG IGLILFAVRD KNPLIGDPEP TLAPLALRVL FVFTCACLWV GLSSSLQEIV KESAIYIRER LVNLGLFAYL ASKVIVLGLL AIFQTLLIAG VIILGFESPQ PEIISWSMGV SITTFLTLIS CISMGLMISS IVKNASQANS ALPLILLPQI IFSGVLFEMK GIASKFSWVM LSRWSVAAYG ALVNVNKMVP EATKLPDGSR VSLPFSGSDI YNLNWGNLGL SWGVLCLHSL IYLGLTIWFK KQKDII
|
| |