Gene Tery_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2179 
Symbol 
ID4242631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3402944 
End bp3404242 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content39% 
IMG OID638107283 
Productdihydroorotase 
Protein accessionYP_721883 
Protein GI113475822 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.805984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.355095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCA AATTACTTCA AAAAGTCCGA ATATTAGACC CTGTTTACGA CACAGACAGA 
GTTGCTGATG TTTTAATTAT TGATGGAAAA ATAGCCACAG TTGAGGAAAA TATTTCCGAG
TTTCCAACTA AAACAGATAT TCAAGACTGT CAGGGATTAA TTTTAGGTCC TGGTTTAGTA
GATTTATATT GTCATAGCGG AGAACCTGGG TTTGAGGAAC GGGAAACTAT AGAATCTCTA
ATGCAAGCTT CTAAGGCAGG TGGTTTTACC AGGTTAGCAA TTTTGCCGGA TACTTTTCCT
CCTGTGGATA ATTTGTCTGG TTTGGCAAGA TTACAAAATT TGGCAACTCA GGTAAATAAT
AATTCTACTT TTCCTCTCTT TTATTATTGG GGTGCTATTA CTCAGGGTGT AAAAGGTAAG
GAAATGACTG AGTTGGGGGA ATTAGCTGCA TCAGGTGTTA TAGGGTTTGC TGATGGTTTG
GCTCTAGAAA ATCTTGGTTT GTTACGACGA GTTTTGGAAT ATTTGAAACC TCTGAATAAA
TCTGTTGCTT TATTTTGCAA AAATTCTGGG TTAGCAGGTA ATGGGGTGAT GCGAGAGGGG
TATGATTCTA TTCGTTTGGG TTTGCCTGGG GTTTCGACTA TGGCTGAAAC TTCTGCTTTG
GCTGCTGTTT TAGAGTTAGT AGATGCTATT GGTACTCCGG TTCATATTAT GCGGGTCTCT
ACTGCTCGGA GTGTAGAGTT AATTGCAAAT GCTAAAAGTA GGGGTTTGCC TATTACTGCT
AGTACGACTT GGATGCATTT GTTGTTGGAT AGTCTGGCTA TTGAGGGTAA GTCTCTTCTG
GATAATTATT TTTTTCCTTA TGACCCGAAT TTACGTTTGG AACCTCCTCT AGGAAGTCAG
AGCGATCGCT TAACTTTACT TGAAGGTATT AGGGATGGGG TTTTAGATGC TATTGCTATT
GACCATGCTC CTTATACTTA TGAAGAGAAA ACTGTTGCTT TTTCTGAAGC ACCCACAGGA
GCAATAGGAC TACAAATAGC ATTACCTTTA TTATGGCAGA GTTTTGTCAA CACAGGACAG
ATGTCGGCTT TAGAGTTATG GAGATTATTA AGTACGTCTC CTAGCAAATG TTTAGGATTA
ATTCCTGGAG ATATCAGACC CCAAAAGTCA GCAGAAGTGA CTTTATTTGC TCCTCAAGAA
ACTTGGGTAG TAGAAAAACA AACTTTGAAA TCTCGTTCTT TTAATACACC TTGGTTAGGA
AAACAAATTC AAGGTCGTGT CCTAGAGTGG GAGTTTTAA
 
Protein sequence
MNSKLLQKVR ILDPVYDTDR VADVLIIDGK IATVEENISE FPTKTDIQDC QGLILGPGLV 
DLYCHSGEPG FEERETIESL MQASKAGGFT RLAILPDTFP PVDNLSGLAR LQNLATQVNN
NSTFPLFYYW GAITQGVKGK EMTELGELAA SGVIGFADGL ALENLGLLRR VLEYLKPLNK
SVALFCKNSG LAGNGVMREG YDSIRLGLPG VSTMAETSAL AAVLELVDAI GTPVHIMRVS
TARSVELIAN AKSRGLPITA STTWMHLLLD SLAIEGKSLL DNYFFPYDPN LRLEPPLGSQ
SDRLTLLEGI RDGVLDAIAI DHAPYTYEEK TVAFSEAPTG AIGLQIALPL LWQSFVNTGQ
MSALELWRLL STSPSKCLGL IPGDIRPQKS AEVTLFAPQE TWVVEKQTLK SRSFNTPWLG
KQIQGRVLEW EF