Gene Tery_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2801 
Symbol 
ID4245335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4344185 
End bp4345183 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content35% 
IMG OID638107853 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_722450 
Protein GI113476389 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.840794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.160271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATT ATAAACAAGC TGGTGTTGAT GTTGAAGCGG GTCATGAATT TGTTAATAAT 
ATTCGCAATT TGGTTATTAG TACCCACCGA CCGGAAGTTT TAGGAGGTTT AGGTGGGTTT
AGTGGATTAT TTGCTTTACC AAGTGGTTAT AAAGAACCTG TTTTGGTTTC TGGTACTGAT
GGTGTAGGTA CAAAATTAAA ACTAGCAAAT ACTTTGAATT GTCATGATAC TGTTGGTATT
GATCTTGTTG CTATGTGTGT TAATGATGTG TTAACATCTG GGGCGGAACC TCTATTTTTT
TTGGATTATT TGGCAACAGG AAAATTAAAT CAACAACAAT TAACTGAAGT GGTTGCTGGT
ATAGCGGAAG GATGTCGTTT GGCAGGGTGT GCTTTAATAG GGGGTGAAAC TGCAGAAATG
CCTGGTTTTT ATCTCCCTGG TGAGTACGAT CTAGCCGGTT TTTGTGTGGG AATTGTGGAA
AAAAGTTTGA TTTTGGATGG GTCTCAGGTA AAAGTAGGTG ATGTGGCAAT TGGATTAGAA
AGTAGTGGAA TTCATAGTAA TGGTTTTAGT TTGGTGAGAA AGATTATTGA TGAATATAAT
ATTTTTCTAC AAGATAACCT AGATTTTTTG GAGCAAAAAA GTTTGGGAGA GATGTTGTTG
ACTCCAACAA AAATTTATGT TAAACCAATT TTAGCAGCGC AAAAGTCTGG TTTAGAAATT
CATGGTATGG CTAATATTAC TGGGGGTGGT TTACCAGAAA ATTTACCTCG TTGTTTGCCA
GAAAAAATGT CTATTGATAT TGATAAGAGT AGTTGGGAAA TTCCTCCTAT TTTTAAGTGG
ATTTCTGAAG TTGGTAATGT AGATAAAGAA GTAATGTTTA ATACTTTTAA TATGGGGATT
GGTTTTGTAG TTTTAGTGTC ACCAACTCAG GTAGAAATGG CGATAGATTT CTTTTTTTCT
CAGGGAATAA AAGCTTATTG TTTGGGTGAA GTAAGTTAG
 
Protein sequence
MMDYKQAGVD VEAGHEFVNN IRNLVISTHR PEVLGGLGGF SGLFALPSGY KEPVLVSGTD 
GVGTKLKLAN TLNCHDTVGI DLVAMCVNDV LTSGAEPLFF LDYLATGKLN QQQLTEVVAG
IAEGCRLAGC ALIGGETAEM PGFYLPGEYD LAGFCVGIVE KSLILDGSQV KVGDVAIGLE
SSGIHSNGFS LVRKIIDEYN IFLQDNLDFL EQKSLGEMLL TPTKIYVKPI LAAQKSGLEI
HGMANITGGG LPENLPRCLP EKMSIDIDKS SWEIPPIFKW ISEVGNVDKE VMFNTFNMGI
GFVVLVSPTQ VEMAIDFFFS QGIKAYCLGE VS