Gene Tery_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5003 
Symbol 
ID4246658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7644229 
End bp7645377 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content40% 
IMG OID638109814 
Productphosphonate metabolism protein PhnM 
Protein accessionYP_724390 
Protein GI113478329 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.849449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTT ATCTTACCCA CTGTCGTTTA ATTACTAATA ATGCAGTTGT TGATGATGCT 
GCTGTGTTAA TTGAAGATGG ATATATTGTC GCTATCAATC CCGAATTTAC TAACAATGTT
GAATCTATTT CCCTAAATGG TCAATATTTA TTACCTGGGT TAGTAGATCT TCATTGCGAT
GCCATTGAAA AAGAAATTGA ACCTCGTCCC AATGCTTTTT TCCCTATGGA TTTTGCGATC
GCTCAAATAG ATCGAAATAA TGCTGCTGTT GGTATTACCA CACCTTTCCA TGCTATCTCC
TTTGCCTATG AAGAATTTGG CCTTCGCAAC AATGAAAAAG CAGCTCAAAT TGTGCGTTCC
CTCCACAATT ATCAGCCCCA AGCATTAGTT AATAACCGGG TCCATTGTCG CTACGAAATT
ACCGACCCTA CAGGGCTACC CATTTTGCTT AATCTGTTGC AGTCAGATGA CATTCATTTG
ATTTCTTTTA TGGACCATAC TCCAGGACAG GGACAATTTA AAAATGTGCA AGCATACCAG
GATTATTTGG CCCGCGCATA CAACAAATCT GCTACAGAAG TCGAAGCAAT AGCCCTCAAA
AAAATCGATC AAGGAGCAGA TGCTCTGGAA CGGGTAAAAA CTTTAATTTC CAAAGCTTTA
TCTTTAGGAG TACAAGTTGC TAGTCATGAT GATGATAGCC CAGAGAGAAT TTCTAGTATG
CAGGCTTTGG GAATACATCT TAGTGAATTT CCGATCAATC TTGAAACGGC CCAAGCTGCT
AAAAAAGCCG GACTCCAAAC CATATTTGGT GCCCCTAATT TACTACGGGG ACAAAGTCAG
AGTGGTTCAA TAAAAGCCAT AGATGCAATT AAACATCACG TGGGAGATAT TCTTTGTGCA
GATTACTCAC CTGCAAGTTT GCTGGCAGCA GCATTTCGAA TTCCTGAATT ACTTGGTTGG
TCATTACCAG ATGCAATAGC CCTTGTTACA CACAACCCTG CACAAGCTGT AAATCTTAGT
GACCGCGGTG AAATTGCTAT AGGCAAACGG GCTGATTTAA TTGTTGTACA GTGTCCTCAT
GGCTTTCCTC AAGTAACAAC TACTTGGGTT GGGGGGCGAA TTGTTTACCA ATGTCATTAC
TCAAGATAA
 
Protein sequence
MKTYLTHCRL ITNNAVVDDA AVLIEDGYIV AINPEFTNNV ESISLNGQYL LPGLVDLHCD 
AIEKEIEPRP NAFFPMDFAI AQIDRNNAAV GITTPFHAIS FAYEEFGLRN NEKAAQIVRS
LHNYQPQALV NNRVHCRYEI TDPTGLPILL NLLQSDDIHL ISFMDHTPGQ GQFKNVQAYQ
DYLARAYNKS ATEVEAIALK KIDQGADALE RVKTLISKAL SLGVQVASHD DDSPERISSM
QALGIHLSEF PINLETAQAA KKAGLQTIFG APNLLRGQSQ SGSIKAIDAI KHHVGDILCA
DYSPASLLAA AFRIPELLGW SLPDAIALVT HNPAQAVNLS DRGEIAIGKR ADLIVVQCPH
GFPQVTTTWV GGRIVYQCHY SR