Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4271 |
Symbol | |
ID | 4245923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6586958 |
End bp | 6590005 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638109163 |
Product | AAA family ATPase |
Protein accession | YP_723741 |
Protein GI | 113477680 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.365445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTA AAGCATGGTA CAAAATAGAT GGTTTAATTC CTAGAGAAGA TTTACGAGAA GGTAAACCTT TAGATACTTC TGAATTTGCC ATTCATCTCG ACCAAGTTCG TAACAATAAT GCTCCGGTAG ATTATCAGCA ACCGGAGCGT TTCTTTGAAC GGACTTATTT AACAAAATCT CTGACCGACT TAGCATCTCA AGTTGTCCGT CGTTTGTCTG GAGAAACTAG CGGTACTTCA GCTATTTTTA ACCTCTCTAC TCAATTTGGA GGGGGTAAAA CTCACGCCTT AACTTTACTA TATCATTTAG CAAAAAATGG CAGTTTAGCT AATAATTGGA CTGGAGTAGA TAAGATTCTC AAAACAGCAA AAATTAATTC TATTCCCGAA GCAGCAGTAG CAGTTTTTGT AGGAGTACAA TTTGATTCTA TTACTGGTAG AGGTGGAAAT GATGGTACTC CGCTACGGAA AACTCCCTGG GGAGAAATTG CTTTTCAATT AGGAGGAGAA ACAGCTTTTA ATTATGTTGC TGAACATGAG AAAAATTTTA TCGAACCCAA GGGTGATGTG ATTCGGAAAT TATTTCCAAA AGATAGACCC TGTTTAATTT TAATGGATGA AATTATTAAC TATATTTCTA CTTATCGCAG TCGTGAATAT CATAACCGCT TTTATAATTT TTTGCAGGCA CTTTCTGAAA CAGCTAGAAG TTTAGAAAAT GTTGTTTTAG TGGTGTCTAT TCCAGCTTCA GAAATGGAAT ATACCCAAGC AGACGAAGCA GACGAACAAC GACTTAAAAA AATGTTAGAC CGTCTGGGAA AAGCTATAGT TATGTCAGCA GAGTCAGAAA CTTCAGAAAT TATCCGTCGT CGTTTGTTTG AGTGGGATAA AGAAGCGGTT ACTACCGAAG GTAAAATTAT GTTGCCCAGA GATGCGATCG CCACCTGTAA TGAATACGCT GAATGGGTGA TATCCCACCA CCAGCAAATA CCTAGTTGGT TTAACGTTGA TAACGCCCAA AAAGCATTCA TCGCTACCTA CCCATTTCAC CCCACAGTTT TGTCAGTATT TGAGCGTAAG TGGCAAGTTT TACCAAGGTT TCAACGAACT AGGGGTATTT TGAGACTATT AGCACTCTGG GTTGCTCGTG CCTATCAAGA AAGTTACAGA AAAGTTCACA ACGACCCACT CATTGGTTTG GGTAATGCTC CCCTAGACGA CCCTCTGTTT CGCACCGCAG TATTTAATCA GTTGGGCGAA GAGCGTTTAG AGGGTGCTGT CACCACAGAT ATTTGTGGTA AGCAGAATGC TCATGCTATC CGGTTAGATA AGGAAGCAGT AGCAACCATT AAAAAATCTC GTCTCCATCG AAAAGTCGCA ACTACTATAT TTTTTGAATC TAATGGCGGT CAACAAAACA CCGAAGCAAC TATTCCTGAA ATTAGGTTAG CAGTGGGAGA GCCAAATTTT GATATTGTCA ATGTAGAAAC AGTTTTGGAA GCATTGCAAA ATGAAAGTTA TTATTTATTG GTAAATAAGA ATAGATATCG GTTTAATATT TCTCCCAACT TGAATAAAAT TTTAGCAGAC CGTCGAGCCA ATATACAAAG TTCTAGAATA CAGGAAAGAG TCGCAGCCGA AATTAAAAAA GTTTTTACTA CTAATGGGGT AATTCAAGTT GTTTATTTTC CAGAATATAC TAATAGTATT GCCAACCGAC CAGTGCTGAA TTTAGCTATT TTAAGTCCCG AATATGTTCG TTCTGATGAG GAGACAAGCA AATTTATAGA ATCAATAATA TTAAATTGTG GTAGTTCAGC TAGAACTTTT AAGAGTGCGA TAATTTTTGT GGTGGCTAAT CCTGATAATC TCCTGAGAGA ACGAGCCAGA AATTTATTAG CTTGGGAGGA TATTAGCGAC CAAGAAACAG AATTAAATGC CGAGCAAAAA CAGCAACTTA AGGAAAATAT TGACAAGTCG AAAAAGTTAT TACAAGAAAC TGTTTGGCAG TCTTATACAT CTGTTGCTTT GTGGGGAAAA AATCCAGAGA TTCCAGAGAT TCAAGAGATT CAGTGGATTG ATTTAGGAAT GCTTAATTCG AGTCAAGCTA ACTCAATGGT AAGTTTAATT ATCAATCGTC TACAGGCTAA TGGAGAAATT ACAAGCGTCA TTCATCCTCG GTTTTTACTC AGAAATTGGC CGCCTGTTTT TACAGAATGG AGTATTAAAA ATATTCAGGA TGCCTTTTTT GCTTCCCCAA CTTTTCCTAG ATTATTAGAA GCTAAAGCTG TGCCAGAAGC TATTGTTCAT GGAGTACGGG AAGGATTATT AGCTTATGTG GGGAAAAGTA AAAATGGTTA TGAACCTTTT ATATTTAATC AACCCAACTT TTCTGTTCAA GATGTAGAAA TATCTGATGA TGTATTTGTA ATTAAAGCAG AAGTAGCAGA AGATTATGAA GCTAGAATTA AAGACCCCCC CAGATTAACT CAATTAATTA TTTCTCCTGA AGAAGTTCTC TTAAAACCTG GTCAAAAACA AACATTTTTA GTGACAGGAA AAGACCAATA TAACCGGTCA ATTAATGTAG AAGAAATTAA CTGGAGTGCT ACAGGAGGAG AAATTAACTT CAATGGTGTT TTAACAGCCG GGAAAGATGA GGGGAATTTT ATAGTTACTG CTCAAGTAGA ATATATTAGC ATTAAAGCCA AATTTACTGT AGAGCTTAAG TTACCTCATC TCCAAGAAAA ATCGGAAAAT TATAATATTA CCGATAGTAG TTTAACCAAG TCAGATAGAA CTAATGAAAT TACTGAAGGT GAAACATCAA ATATTCCTAA TATAATTTCT TGGCGAGGAG AAATTACACC TCAAAAGTGG ATGCAATTTT ATACAAAAGT TCTGAGTAGG TTTGCTGCAA AAAAAGATTT TAATTTAACA GTAGAAATTA ATTTTTCTGT AACCGGAAGC ATTACATCTC AAAATATTAA TGAAACCAAA GTTTCTCTCC AGGAATTAGG ACTTGATGAT GGTATTGAAA CTAGCTAA
|
Protein sequence | MAIKAWYKID GLIPREDLRE GKPLDTSEFA IHLDQVRNNN APVDYQQPER FFERTYLTKS LTDLASQVVR RLSGETSGTS AIFNLSTQFG GGKTHALTLL YHLAKNGSLA NNWTGVDKIL KTAKINSIPE AAVAVFVGVQ FDSITGRGGN DGTPLRKTPW GEIAFQLGGE TAFNYVAEHE KNFIEPKGDV IRKLFPKDRP CLILMDEIIN YISTYRSREY HNRFYNFLQA LSETARSLEN VVLVVSIPAS EMEYTQADEA DEQRLKKMLD RLGKAIVMSA ESETSEIIRR RLFEWDKEAV TTEGKIMLPR DAIATCNEYA EWVISHHQQI PSWFNVDNAQ KAFIATYPFH PTVLSVFERK WQVLPRFQRT RGILRLLALW VARAYQESYR KVHNDPLIGL GNAPLDDPLF RTAVFNQLGE ERLEGAVTTD ICGKQNAHAI RLDKEAVATI KKSRLHRKVA TTIFFESNGG QQNTEATIPE IRLAVGEPNF DIVNVETVLE ALQNESYYLL VNKNRYRFNI SPNLNKILAD RRANIQSSRI QERVAAEIKK VFTTNGVIQV VYFPEYTNSI ANRPVLNLAI LSPEYVRSDE ETSKFIESII LNCGSSARTF KSAIIFVVAN PDNLLRERAR NLLAWEDISD QETELNAEQK QQLKENIDKS KKLLQETVWQ SYTSVALWGK NPEIPEIQEI QWIDLGMLNS SQANSMVSLI INRLQANGEI TSVIHPRFLL RNWPPVFTEW SIKNIQDAFF ASPTFPRLLE AKAVPEAIVH GVREGLLAYV GKSKNGYEPF IFNQPNFSVQ DVEISDDVFV IKAEVAEDYE ARIKDPPRLT QLIISPEEVL LKPGQKQTFL VTGKDQYNRS INVEEINWSA TGGEINFNGV LTAGKDEGNF IVTAQVEYIS IKAKFTVELK LPHLQEKSEN YNITDSSLTK SDRTNEITEG ETSNIPNIIS WRGEITPQKW MQFYTKVLSR FAAKKDFNLT VEINFSVTGS ITSQNINETK VSLQELGLDD GIETS
|
| |