Gene Tery_2327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2327 
Symbol 
ID4245209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3601653 
End bp3603533 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content36% 
IMG OID638107422 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_722022 
Protein GI113475961 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.761686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000043688 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTATTCCT TTGATAATCT GAACTTAGTA ATGTTTAGTG GCAAAGGAGG CGTGGGAAAA 
ACTACAAATT CTTGTGCTTT TGCTGGCCAT TGGGCAAAAA AATCTCCTAA TGAAAAGGTC
TTGTTAATTT CTACTGATCC TGCACACTCT TTAGGAGATG TATTGCAAAT AAGTGTTACA
GATACACCAA GACCTTTAGA AGATTTACCT AATCTTTTAG TCAGAGCTTT AGATGTTAAT
CTTCTGCTTG AAGAATTTAA AAAACGTTAC GGCGATGTCC TAGAAGTAAT TGTAGAGCGA
GGTAGTTTTG TTGAGGGGGG AGACCTGACT CCAGTATGGG ATTTAGATTG GCCAGGATTA
GATGAATTAA TGGCTTTACT GGAAATACAA AGGCTTTGTA ACGAAAAAGT AGTTGATCGC
GTAGTAGTGG ACATGGCTCC GAGCGGTCAT ACTCTAAACT TATTCAAACT AATGGATTTT
TTGGATACTT TTCTCAATTC TCTAGAGCTT TTTCAAGAGA AACATAAATA TATAAAGAAA
AGCTTCTCAG GTTCTTATAT ACCAAATGAA GTTGATGAAG TATTACAAAA TCTTAAGGAT
GAATTAGCAG CAGGTCGTCA TTTATTACAA AATTCATCTA ATACGGCTTG TTTAGTTGTA
GCACTACCTG AACCGATGAG TTTTCGAGAA ACTCAGAGGT TTTTAAGTTC TTTGGAAGAG
ATCAAAATTC CTTACGCTGG TATTGTTGTC AATCAAATTG TTGTAGATAA AGATGGTAAT
AGCGATCGCT ACCACGAACA GCAAAAGTTA GTTAATGACT TTATTAAACT CGCGGGGGAT
AAACCAGTAT TTTTGGTACC CCAAGAAAAA GCTGAACCTT TAGGTGTAAC TGCTTTAGAA
AAGCTAACAA ACCAAATTAA TCAAGCAACA ATTCAAGAAT TCTCAACTTC TATAAGCTTC
AATATTCAAT GGCCAGAAAA GGTTCCTCCT GGTTTTACCG ACTTTATCGG AAAGGGTAAA
CGTTTATTAA TTGTTGGAGG AAAAGGAGGT GTAGGAAAGA CTACGATAGC TGCTGCTATT
AGTTGGGAAA TGGCAAAACG ATATCCCGAA AGACAAGTTA GGGCTGTTTC TATTGATCCT
GCTCATTCTT TGGGAGATGC CTTTGGTATG GATCTATGTC ATGAACCATC TATAATTAGT
TCTAACCTTA AGGGACAAGA AATAGAAGCA AATAAAGTTC TTGAAAAATT TCGAGAGGAT
TATTTGTGGG AATTAGCAGA AATGATGAGC GGAGAAAAAT CCGAAAATCA AGCCTCTTTT
GAAATGGCTT TTGCCCCTAA AGGATGGCGT CAAATTGTTG AGCAAGCTTT ACCTGGTATT
GATGAAATAT TATCATTTAT AACAGTTATT GAGTTATTGG AGGAAAAACA AGAAGATTTA
ATTGTTTTAG ATACAGCTCC CACAGGGCAT TTATTACGTT TTCTAGAAAT GCCCACTGCA
ATACAGGATT GGTTAGGCTG GATTTTTAAA TTATGGATAA AATATCAAGA TATTATTGGT
AAAACTGATT TTATGGGACG TTTGCGAACT TTACGCCAAC GGGTGGTTAA AGCACAAAAA
ATTCTCAAAG ATCCAAAAAA AACTGAGTTT ATTGGAGTAA TACGTCCTCA AAAAGGAGTT
ATTGCGGAAG CAGAACGTCT TTATAAATCT CTTGCAGAAA TGCACATACC ACAAAATTAT
TTAGTTTTAA ACTGTTTTAC CTCTAATAGC GTTATCCCTA CTGACCAATT TCCGGGGGTA
CAATTTGTTT GTATGCCAAT GTTACCTCGT TCAATTGAAC CTATTGAACA AATTAAAGGA
GCAGCTACTT ATATATTCTA A
 
Protein sequence
MYSFDNLNLV MFSGKGGVGK TTNSCAFAGH WAKKSPNEKV LLISTDPAHS LGDVLQISVT 
DTPRPLEDLP NLLVRALDVN LLLEEFKKRY GDVLEVIVER GSFVEGGDLT PVWDLDWPGL
DELMALLEIQ RLCNEKVVDR VVVDMAPSGH TLNLFKLMDF LDTFLNSLEL FQEKHKYIKK
SFSGSYIPNE VDEVLQNLKD ELAAGRHLLQ NSSNTACLVV ALPEPMSFRE TQRFLSSLEE
IKIPYAGIVV NQIVVDKDGN SDRYHEQQKL VNDFIKLAGD KPVFLVPQEK AEPLGVTALE
KLTNQINQAT IQEFSTSISF NIQWPEKVPP GFTDFIGKGK RLLIVGGKGG VGKTTIAAAI
SWEMAKRYPE RQVRAVSIDP AHSLGDAFGM DLCHEPSIIS SNLKGQEIEA NKVLEKFRED
YLWELAEMMS GEKSENQASF EMAFAPKGWR QIVEQALPGI DEILSFITVI ELLEEKQEDL
IVLDTAPTGH LLRFLEMPTA IQDWLGWIFK LWIKYQDIIG KTDFMGRLRT LRQRVVKAQK
ILKDPKKTEF IGVIRPQKGV IAEAERLYKS LAEMHIPQNY LVLNCFTSNS VIPTDQFPGV
QFVCMPMLPR SIEPIEQIKG AATYIF