Gene Tery_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1047 
Symbol 
ID4242010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1636808 
End bp1638328 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content40% 
IMG OID638106280 
ProductAAA ATPase, central region 
Protein accessionYP_720892 
Protein GI113474831 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.696597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0936811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAG AACTCAATAT TCTAATCGAA GCTCAATATC CTTTGATCTA CCTCGTAACC 
TCCGAGGAAG AGCGGTCAGA GCAGGCAATT TTAGCATTAG CTCAGAGAAA ACTACAGCGC
AAAGTATTTG TTTGGACAGT GACTCACGGT ATTACAGACT ATGATCAGAG CAAAAATACG
ACTCAGCACA ATACAGTCTC GCCCGAGTCA GCTATAGAGT GGGTAATTAG GCAGCGAGAT
CCTAACACTG GCGCTGGAAT ATATGTATTC AAGGATTTAC ATCCTTTTAT CGATTCACCA
CCAGTTACTA GGTGGTTAAG AGATGCGATA GCTAGTTTTA AAGGTACAAA AAAGACAATT
ATATTAATGT CTCCTGTGCA AAATGTACCC ATAGAATTAG AAAAGGAAGT AGTTGTCCTT
GACTTTCCAT TGCCAGATAT GAAAGAATTA AATCAAGTTC TCTCAGGACA ATTAGATTCT
GCTAAAAACC GACGTATTTC TACAGAAACA AGAGAAAAAC TACTAAAAGC AGCTCTGGGT
TTGACAAAAG ATGAAGCCGA AAAAGTATAT CGTAAAGCTC AAGTAACAGC AGGACGCCTA
ACTGAAAAGG AAGTTGACAT TGTACTTTCT GAGAAAAAAC AGCTCATCAG GCGCAACGGT
ATACTAGAAT ACATCGAAAA GGACGAAACT ATAAATGCTG TAGGTGGTCT AGAGGAGTTG
AAACATTGGT TAAGGCAACG TTCTGATGCC TTTACAGAGC GTGCCCGAGA ATATGGACTA
CCTCAACCAA AGGGAATGTT GATTCTAGGA ATACCTGGAT GTGGCAAGTC TCTGATAGCA
AAAACTACAT CTGGTCTATG GGGTCTACCT TTATTGCGAT TAGATATGGG ACGTGTATAC
GATGGTTCAA TGGTAGGACG CTCAGAGGCT AACTTGCGAA ATGCTCTCAG AACAGCTGAA
TCAATTTCAC CTGCTATTTT ATTTATAGAT GAGTTAGATA AAGCCTTTGC AGGTAGTACA
GGTTCAGCTG ATTCTGATGG AGGTACTTCT AGTCGGATAT TTGGCTCATT CCTAACTTGG
ATGCAGGAAA AAACTTCTCC AGTGTTTGTT ATGGCAACTG CCAACCGGGT AGAACGTCTA
CCAGGAGAGT TTTTGAGAAA AGGTAGGTTT GATGAAATTT TCTTTGTAGA CTTACCAAAC
AAAGAAGAAC GCCAAGATAT TTTCCAAATT CACCTAATAA AAAGACGTCG AGATATTGAA
CGCTTCGATC TGGATCAACT ATCTAATGTA TCCGATGGCT TTTCAGGTGC AGAAATAGAG
CAAGCCATAA TTGCTGCTAT GTATGAAGCA TTTGCTCAAG ATAGAGAATT TACACAGCTA
GATATTATTG CCGCAATTAA ATCTACACTA CCGTTATCGA AGACCATGAC AGAGCAAGTT
ACTGCTCTAA GAGATTGGGC TAGACAACGT GCGCGGCCTG CTGCATCTTC AGTTGCCGAG
TATCAAAGAC TGGAGTTCTA A
 
Protein sequence
MKEELNILIE AQYPLIYLVT SEEERSEQAI LALAQRKLQR KVFVWTVTHG ITDYDQSKNT 
TQHNTVSPES AIEWVIRQRD PNTGAGIYVF KDLHPFIDSP PVTRWLRDAI ASFKGTKKTI
ILMSPVQNVP IELEKEVVVL DFPLPDMKEL NQVLSGQLDS AKNRRISTET REKLLKAALG
LTKDEAEKVY RKAQVTAGRL TEKEVDIVLS EKKQLIRRNG ILEYIEKDET INAVGGLEEL
KHWLRQRSDA FTERAREYGL PQPKGMLILG IPGCGKSLIA KTTSGLWGLP LLRLDMGRVY
DGSMVGRSEA NLRNALRTAE SISPAILFID ELDKAFAGST GSADSDGGTS SRIFGSFLTW
MQEKTSPVFV MATANRVERL PGEFLRKGRF DEIFFVDLPN KEERQDIFQI HLIKRRRDIE
RFDLDQLSNV SDGFSGAEIE QAIIAAMYEA FAQDREFTQL DIIAAIKSTL PLSKTMTEQV
TALRDWARQR ARPAASSVAE YQRLEF