Gene Acid345_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4149 
Symbol 
ID4072340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4908078 
End bp4909367 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID637986180 
Productdihydroorotase 
Protein accessionYP_593223 
Protein GI94971175 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.171454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTA CATCTGTTTT GATCCGGCGC GGGCATGTAA TTGACCCGGC GAACAACATT 
GACCGTCCCA TGGACGTACT CCTGCGCGAA GGACGCGTGG CGGCGATTAC CGAACCCGGG
GGCATCAAGT CCGAATACGA AGAAGAGTTT GACGCGAACC ACCTGGTGGT GGCGCCGGGC
TTTATTGACC TGCATGTGCA CCTGCGCGAG CCGGGGCAGG CGCACAAGGA AACCATTGCG
AGCGGCACGC GCTCGGCGGC GGCGGGCGGC TTTACGTCCG TCTGCGCGAT GCCCAACACT
TCGCCGGTGA ATGACACTCC GGAGACCACC ACGTGGATGC TGCAGCCGGA CCGTGGCGCG
GTAGTGAACG TCTTCCCGAT TGCCGCGGCC ACCATCGGAA GCAACGGTGA AAAGCTCACC
AACTTCCGCG ATTTACAGCG CGCGGGTGCG GTGGCGATCA GCGATGACGG CAAGCCGATC
CTGGACGACA ACCTGATGCG GGAGGCGCTG CGCACCGCGG CGCGGCTGGA GATGCCAGTG
GTGCAGCACG CGGAAGATCC TCGGATGCAT CCGGGCGGCT GCATGAATTA CGGTGTGACT
TCGTTGCGGC TGGGACTGCG CGGCATCCCG AATGCGAGCG AAGCCAGCGT GGTGCTGCGC
GATATCCGGC TCACGCGCGA GTCGCGCGCG CACTTGCACG TGGCGCATAT CTCCACGGCC
GAGGCGCTTG ACGCCGTGCG CCGGGCGAAG AAAGAAAACT TGCGTGTGAC CGCCGAGGTT
ACGCCGCACC ACTTCACGCT GCTCGACGAA AACATTGGCC ACTACGACAC GGCATACAAG
ATGAATCCGC CGCTACGCGC GAACCCGGAC CGCGACGCGA TGATTGCCGG CCTGAAAGAC
GGCACGCTCG ATTGCATTGC CACCGACCAT GCACCGCACG CGTATCACGA GAAAGAACAG
GAATTCGACC GCGCGCCCTT CGGCATTATC GGCCTCGAGA CGGCGCTGCC GCTGGCGATT
ACCGTGTTGC ACAAGCACTT CGAAATTCCG CTCACGCGGA TCGTGCAACT GATGAGCACC
AGTCCGGCGC GGCTTTTCCA ACTCATGCAT CGCGGCTCGC TGGCGGTTGG TTCGCATGCC
GACGTCGTGG TCTTCGATCC GAAGATGAAG TGGAAGTTCG AGGCGGCGAA GGGCCACTCG
AAATCGAAGA ACACACCGTT CGACGGCTGG GACTTCATGG GCAAGGTGAT GGCGACGATT
GTGGGCGGAA GACCGGTTTA TCTGGCGTAA
 
Protein sequence
MTSTSVLIRR GHVIDPANNI DRPMDVLLRE GRVAAITEPG GIKSEYEEEF DANHLVVAPG 
FIDLHVHLRE PGQAHKETIA SGTRSAAAGG FTSVCAMPNT SPVNDTPETT TWMLQPDRGA
VVNVFPIAAA TIGSNGEKLT NFRDLQRAGA VAISDDGKPI LDDNLMREAL RTAARLEMPV
VQHAEDPRMH PGGCMNYGVT SLRLGLRGIP NASEASVVLR DIRLTRESRA HLHVAHISTA
EALDAVRRAK KENLRVTAEV TPHHFTLLDE NIGHYDTAYK MNPPLRANPD RDAMIAGLKD
GTLDCIATDH APHAYHEKEQ EFDRAPFGII GLETALPLAI TVLHKHFEIP LTRIVQLMST
SPARLFQLMH RGSLAVGSHA DVVVFDPKMK WKFEAAKGHS KSKNTPFDGW DFMGKVMATI
VGGRPVYLA