Gene Tery_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1473 
Symbol 
ID4241679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2235212 
End bp2236372 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content36% 
IMG OID638106626 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_721236 
Protein GI113475175 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0646926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAA TGAAAATGAA TGAAGAGGAA AAAAAACAAC AAAACAAAAT TCAGCGAGTG 
GGTGTTGTTG GTGGTGGTCA ATTAGCATGG ATGATGGGGG ATGCAGCAAA AAAACTAGGA
GTAGATTTAA TTATTCAAAC TCCTCATCAA GATGACCCAG CAGTATCTAT TGCAAAGGAT
ATAATTTTGG CAGAAATTGA TGACCCTCAG GCAACTACTA AGTTAGCAAA TATTTGTGAT
GTGATTACCT TTGAGAATGA GTTTATTAAT ATAGATGAGT TATCATATCT TGCTGAAAAG
AATGTAATTT TTCGTCCTAG TTTATCTGTG TTGAAGCCGT TATTAGATAA ATATGAACAG
TTATGTTATT TACGATATTT GGGTTTACCT GTACCGAACT TTTGGGAGTG GGGAATAGAG
ATTGAGCCTT TATCTTTCCC TTTGGTATTG AAAGCTCGTC GTCATGGTTA TGACGGTCAG
GGTACTTTTA TTATTAAGAA TATTGAGAGT TTAAAATCTC AGGGAAATTC AGAATTTTTC
ATACAAGAAT TTATTCCTTT TGAACGAGAG GTTGCTGTTA TTGCTGCTCG TGGAGTTACT
GGGGAAGTTA AGGTTTATCC TGTGGTAGAA ACTCAACAAG AAAACCAGGT TTGCCAGCGG
GTTTTTGTAC CTGATGAAAA TTTAGAATTA GTAACAGAAA TTGAAGAGAT CGCTCAGACT
CTCCTTAATA GTTTAGAAGT AGTAGGAGTA TTTGGGATAG AAATGTTTAT TACTAAAGAC
AAGAAGGTTT TAATTAATGA AATTGCTCCA AGAACTCATA ACTCTGGTCA TTACAGTTTG
GATGCTTGTG AAGTTTCCCA GTTTGAACAA CATTTACGGG CTGTTTGTGG TTTACCTTTA
GGTAATACTA CTCTCAAAGT AAGGAGAGCG GTGATGGTAA ATTTATTGGG TTATGAATTT
GGGGAAAATT ACTACTTGAC AAAACGGCAA ATGTTAGAAA AAATTCCTCA TGCTTCGGTT
TGGTGGTATG GCAAAACAGA ATCTCGCCCA GGACGGAAGT TAGGTCATGT GACTGTTTTG
CTAGATGAGG AAAATTTTGA AATTATGGGT CGCAAGGGTG AGGCGATCGC TAATAAAATA
GAGAATATCT GGTACACCTA A
 
Protein sequence
MVKMKMNEEE KKQQNKIQRV GVVGGGQLAW MMGDAAKKLG VDLIIQTPHQ DDPAVSIAKD 
IILAEIDDPQ ATTKLANICD VITFENEFIN IDELSYLAEK NVIFRPSLSV LKPLLDKYEQ
LCYLRYLGLP VPNFWEWGIE IEPLSFPLVL KARRHGYDGQ GTFIIKNIES LKSQGNSEFF
IQEFIPFERE VAVIAARGVT GEVKVYPVVE TQQENQVCQR VFVPDENLEL VTEIEEIAQT
LLNSLEVVGV FGIEMFITKD KKVLINEIAP RTHNSGHYSL DACEVSQFEQ HLRAVCGLPL
GNTTLKVRRA VMVNLLGYEF GENYYLTKRQ MLEKIPHASV WWYGKTESRP GRKLGHVTVL
LDEENFEIMG RKGEAIANKI ENIWYT