Gene Tery_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4271 
Symbol 
ID4245923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6586958 
End bp6590005 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content36% 
IMG OID638109163 
ProductAAA family ATPase 
Protein accessionYP_723741 
Protein GI113477680 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.365445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTA AAGCATGGTA CAAAATAGAT GGTTTAATTC CTAGAGAAGA TTTACGAGAA 
GGTAAACCTT TAGATACTTC TGAATTTGCC ATTCATCTCG ACCAAGTTCG TAACAATAAT
GCTCCGGTAG ATTATCAGCA ACCGGAGCGT TTCTTTGAAC GGACTTATTT AACAAAATCT
CTGACCGACT TAGCATCTCA AGTTGTCCGT CGTTTGTCTG GAGAAACTAG CGGTACTTCA
GCTATTTTTA ACCTCTCTAC TCAATTTGGA GGGGGTAAAA CTCACGCCTT AACTTTACTA
TATCATTTAG CAAAAAATGG CAGTTTAGCT AATAATTGGA CTGGAGTAGA TAAGATTCTC
AAAACAGCAA AAATTAATTC TATTCCCGAA GCAGCAGTAG CAGTTTTTGT AGGAGTACAA
TTTGATTCTA TTACTGGTAG AGGTGGAAAT GATGGTACTC CGCTACGGAA AACTCCCTGG
GGAGAAATTG CTTTTCAATT AGGAGGAGAA ACAGCTTTTA ATTATGTTGC TGAACATGAG
AAAAATTTTA TCGAACCCAA GGGTGATGTG ATTCGGAAAT TATTTCCAAA AGATAGACCC
TGTTTAATTT TAATGGATGA AATTATTAAC TATATTTCTA CTTATCGCAG TCGTGAATAT
CATAACCGCT TTTATAATTT TTTGCAGGCA CTTTCTGAAA CAGCTAGAAG TTTAGAAAAT
GTTGTTTTAG TGGTGTCTAT TCCAGCTTCA GAAATGGAAT ATACCCAAGC AGACGAAGCA
GACGAACAAC GACTTAAAAA AATGTTAGAC CGTCTGGGAA AAGCTATAGT TATGTCAGCA
GAGTCAGAAA CTTCAGAAAT TATCCGTCGT CGTTTGTTTG AGTGGGATAA AGAAGCGGTT
ACTACCGAAG GTAAAATTAT GTTGCCCAGA GATGCGATCG CCACCTGTAA TGAATACGCT
GAATGGGTGA TATCCCACCA CCAGCAAATA CCTAGTTGGT TTAACGTTGA TAACGCCCAA
AAAGCATTCA TCGCTACCTA CCCATTTCAC CCCACAGTTT TGTCAGTATT TGAGCGTAAG
TGGCAAGTTT TACCAAGGTT TCAACGAACT AGGGGTATTT TGAGACTATT AGCACTCTGG
GTTGCTCGTG CCTATCAAGA AAGTTACAGA AAAGTTCACA ACGACCCACT CATTGGTTTG
GGTAATGCTC CCCTAGACGA CCCTCTGTTT CGCACCGCAG TATTTAATCA GTTGGGCGAA
GAGCGTTTAG AGGGTGCTGT CACCACAGAT ATTTGTGGTA AGCAGAATGC TCATGCTATC
CGGTTAGATA AGGAAGCAGT AGCAACCATT AAAAAATCTC GTCTCCATCG AAAAGTCGCA
ACTACTATAT TTTTTGAATC TAATGGCGGT CAACAAAACA CCGAAGCAAC TATTCCTGAA
ATTAGGTTAG CAGTGGGAGA GCCAAATTTT GATATTGTCA ATGTAGAAAC AGTTTTGGAA
GCATTGCAAA ATGAAAGTTA TTATTTATTG GTAAATAAGA ATAGATATCG GTTTAATATT
TCTCCCAACT TGAATAAAAT TTTAGCAGAC CGTCGAGCCA ATATACAAAG TTCTAGAATA
CAGGAAAGAG TCGCAGCCGA AATTAAAAAA GTTTTTACTA CTAATGGGGT AATTCAAGTT
GTTTATTTTC CAGAATATAC TAATAGTATT GCCAACCGAC CAGTGCTGAA TTTAGCTATT
TTAAGTCCCG AATATGTTCG TTCTGATGAG GAGACAAGCA AATTTATAGA ATCAATAATA
TTAAATTGTG GTAGTTCAGC TAGAACTTTT AAGAGTGCGA TAATTTTTGT GGTGGCTAAT
CCTGATAATC TCCTGAGAGA ACGAGCCAGA AATTTATTAG CTTGGGAGGA TATTAGCGAC
CAAGAAACAG AATTAAATGC CGAGCAAAAA CAGCAACTTA AGGAAAATAT TGACAAGTCG
AAAAAGTTAT TACAAGAAAC TGTTTGGCAG TCTTATACAT CTGTTGCTTT GTGGGGAAAA
AATCCAGAGA TTCCAGAGAT TCAAGAGATT CAGTGGATTG ATTTAGGAAT GCTTAATTCG
AGTCAAGCTA ACTCAATGGT AAGTTTAATT ATCAATCGTC TACAGGCTAA TGGAGAAATT
ACAAGCGTCA TTCATCCTCG GTTTTTACTC AGAAATTGGC CGCCTGTTTT TACAGAATGG
AGTATTAAAA ATATTCAGGA TGCCTTTTTT GCTTCCCCAA CTTTTCCTAG ATTATTAGAA
GCTAAAGCTG TGCCAGAAGC TATTGTTCAT GGAGTACGGG AAGGATTATT AGCTTATGTG
GGGAAAAGTA AAAATGGTTA TGAACCTTTT ATATTTAATC AACCCAACTT TTCTGTTCAA
GATGTAGAAA TATCTGATGA TGTATTTGTA ATTAAAGCAG AAGTAGCAGA AGATTATGAA
GCTAGAATTA AAGACCCCCC CAGATTAACT CAATTAATTA TTTCTCCTGA AGAAGTTCTC
TTAAAACCTG GTCAAAAACA AACATTTTTA GTGACAGGAA AAGACCAATA TAACCGGTCA
ATTAATGTAG AAGAAATTAA CTGGAGTGCT ACAGGAGGAG AAATTAACTT CAATGGTGTT
TTAACAGCCG GGAAAGATGA GGGGAATTTT ATAGTTACTG CTCAAGTAGA ATATATTAGC
ATTAAAGCCA AATTTACTGT AGAGCTTAAG TTACCTCATC TCCAAGAAAA ATCGGAAAAT
TATAATATTA CCGATAGTAG TTTAACCAAG TCAGATAGAA CTAATGAAAT TACTGAAGGT
GAAACATCAA ATATTCCTAA TATAATTTCT TGGCGAGGAG AAATTACACC TCAAAAGTGG
ATGCAATTTT ATACAAAAGT TCTGAGTAGG TTTGCTGCAA AAAAAGATTT TAATTTAACA
GTAGAAATTA ATTTTTCTGT AACCGGAAGC ATTACATCTC AAAATATTAA TGAAACCAAA
GTTTCTCTCC AGGAATTAGG ACTTGATGAT GGTATTGAAA CTAGCTAA
 
Protein sequence
MAIKAWYKID GLIPREDLRE GKPLDTSEFA IHLDQVRNNN APVDYQQPER FFERTYLTKS 
LTDLASQVVR RLSGETSGTS AIFNLSTQFG GGKTHALTLL YHLAKNGSLA NNWTGVDKIL
KTAKINSIPE AAVAVFVGVQ FDSITGRGGN DGTPLRKTPW GEIAFQLGGE TAFNYVAEHE
KNFIEPKGDV IRKLFPKDRP CLILMDEIIN YISTYRSREY HNRFYNFLQA LSETARSLEN
VVLVVSIPAS EMEYTQADEA DEQRLKKMLD RLGKAIVMSA ESETSEIIRR RLFEWDKEAV
TTEGKIMLPR DAIATCNEYA EWVISHHQQI PSWFNVDNAQ KAFIATYPFH PTVLSVFERK
WQVLPRFQRT RGILRLLALW VARAYQESYR KVHNDPLIGL GNAPLDDPLF RTAVFNQLGE
ERLEGAVTTD ICGKQNAHAI RLDKEAVATI KKSRLHRKVA TTIFFESNGG QQNTEATIPE
IRLAVGEPNF DIVNVETVLE ALQNESYYLL VNKNRYRFNI SPNLNKILAD RRANIQSSRI
QERVAAEIKK VFTTNGVIQV VYFPEYTNSI ANRPVLNLAI LSPEYVRSDE ETSKFIESII
LNCGSSARTF KSAIIFVVAN PDNLLRERAR NLLAWEDISD QETELNAEQK QQLKENIDKS
KKLLQETVWQ SYTSVALWGK NPEIPEIQEI QWIDLGMLNS SQANSMVSLI INRLQANGEI
TSVIHPRFLL RNWPPVFTEW SIKNIQDAFF ASPTFPRLLE AKAVPEAIVH GVREGLLAYV
GKSKNGYEPF IFNQPNFSVQ DVEISDDVFV IKAEVAEDYE ARIKDPPRLT QLIISPEEVL
LKPGQKQTFL VTGKDQYNRS INVEEINWSA TGGEINFNGV LTAGKDEGNF IVTAQVEYIS
IKAKFTVELK LPHLQEKSEN YNITDSSLTK SDRTNEITEG ETSNIPNIIS WRGEITPQKW
MQFYTKVLSR FAAKKDFNLT VEINFSVTGS ITSQNINETK VSLQELGLDD GIETS