Gene Tery_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0231 
Symbol 
ID4242385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp359583 
End bp361217 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content38% 
IMG OID638105575 
ProductPpx/GppA phosphatase 
Protein accessionYP_720192 
Protein GI113474131 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.448988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACAT CAGTTCCTTT AGTTAGATTT CCTGCTCCAC TAGCAGGCCA AAACCCGATA 
TTAGCAGCAA TAGATATTGG TACAAATTCG TTGCACTTGG TTGTAGTAAA AATTGATCCT
AATTTGCCGA CTTTTACTAT TATTGACCAA GATAAAGAAA CAGTAAGACT AGGAGAGTGC
GGCACGAAAG GTAACTTAAA ACCAGAAGTG ATGGATAGAG CGATCGCTAC TTTAGAGCGT
TTCCAACAAA TTGCTAAAAG TGCTAATGCT AAACAAATCA TTACAGTTGC TACTAGTGCT
GTAAGAGAAG CACCTAACGG TAAAGAATTT CTCAATAGAA TAGCTGATGA GTTAAACCTA
TATGTTGACT TGATATCTGG TCAAGAAGAA GCACGACGAA TTTATCTAGG GGTACTTTCA
GCAATGGAAT TTAATAACCA ACCCCATGTT ATTATTGATA TTGGTGGAGG TTCCACAGAG
TTGATCTTAG GTGATAGTGA TACACTGAGA ACTCTAAGTA GTACAAAAGT AGGTGCAGTA
CGTCTGACTA AAGAATTTGT TACCACAAAT CCCGTTAGTA AGAGTGAGTT TGCCTATCTG
CAAGCTTACA TTAGAGGCTT ATTAGAACGC CCGACCAAAA ACATATTAGC TAATATCAAG
AAAGGTGAAA AACCTCAGTT AGTCGGAACT GCTGGTACTA TTGAAGCTTT AGCGACTATT
AATGCTTATG AAAAATTGGG TAATGTACCA GCTCCCCTGG GTGGTTATCA GTTTAGTTTG
ACAGAATTAG AGGAGTTGGT TAATAAGTTG AGGAAGTTAC CTATTTCTAA AAGACGAGAA
ATTCTGGGAA TGTCCGAAAA GCGAGCAGAA ATTATTTTGG CGGGTGCTTT AGTGTTACAC
GAAGCAATGA GTTTATTAGA GATGGAGTCG GTGACTGTGT GTGAAAGCAG TTTGCGAGAG
GGGGTAATAG TTGATTGGAT GTTGAATCAT GGTTTGATTG AGGATCACCT TCGTTTCCAA
AGTTCAATTC GCCAACGAAA TACTCTAAAA ATTGCGCAAA AATACCAGGT TAATTTGGAG
TATAGCGAAC GGGTTGCTTT TTGGGCGTTA TATTTATTTG ACCAAACTAT GGGAGTTCTG
CATAACTGGG GAAGTGAGGA ACGAGAATTG TTATGGTCAG CAGCAATTTT GCATAATTGT
GGTATATATG TAAATCATTC AGAACATCAT AAACATTCCT ACTATTTGAT CAGAAATGGG
GAGTTATTGG GGTATACTCA AATTGAAATT GAAGTTATTG CTAATTTAGC TCGTTATCAC
CGCAAGAGTT TATGCAAGAA AAAACACGAC CATTATCAAA TTTTACCCAA AAGATATCAA
GAAATGGTGT CTCAGTTGAG TTCATTGTTA CGTTTAGCGG TAGCTTTAGA TAGGCGACAA
AAAGGGGCGA TAGCTAATTT GACCTGTTGG TTAAACACAA AGCAACAGGA ATTTCATCTC
TGGTTACGGC CTGCTAACCC TAAAGATGAT TGTGCTTTAG AATTGTGGAG TTTGGAAAAT
AAGAAGGAGG CCTTTGAAAA AGAGTTTGGT TTAAAATTAA TAGTAAATTT AGAATCTGCC
TCGTTAGTAA CTTGA
 
Protein sequence
MITSVPLVRF PAPLAGQNPI LAAIDIGTNS LHLVVVKIDP NLPTFTIIDQ DKETVRLGEC 
GTKGNLKPEV MDRAIATLER FQQIAKSANA KQIITVATSA VREAPNGKEF LNRIADELNL
YVDLISGQEE ARRIYLGVLS AMEFNNQPHV IIDIGGGSTE LILGDSDTLR TLSSTKVGAV
RLTKEFVTTN PVSKSEFAYL QAYIRGLLER PTKNILANIK KGEKPQLVGT AGTIEALATI
NAYEKLGNVP APLGGYQFSL TELEELVNKL RKLPISKRRE ILGMSEKRAE IILAGALVLH
EAMSLLEMES VTVCESSLRE GVIVDWMLNH GLIEDHLRFQ SSIRQRNTLK IAQKYQVNLE
YSERVAFWAL YLFDQTMGVL HNWGSEEREL LWSAAILHNC GIYVNHSEHH KHSYYLIRNG
ELLGYTQIEI EVIANLARYH RKSLCKKKHD HYQILPKRYQ EMVSQLSSLL RLAVALDRRQ
KGAIANLTCW LNTKQQEFHL WLRPANPKDD CALELWSLEN KKEAFEKEFG LKLIVNLESA
SLVT