Gene Mflv_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3169 
Symbol 
ID4974490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3349841 
End bp3351208 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID640457392 
Productallantoinase 
Protein accessionYP_001134434 
Protein GI145223756 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.45812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG ATACCCGCGA GCAGTCGAAT CACGCCGATC TCGACCTGGT CGTCCGCGGT 
GAACGCATGC TGACGACAGC GGGAATCGTC GCCCGCGAGA TCGGCATCCG CGACGGTCGC
ATCGTCGCGA TCGAGCCGCT GGGCAGCGGT CTTCCCGGCG CGGAGATCGT CGAGCTCACC
GACGAGCAGG TCATGATCCC CGGCCTCGTC GACACGCACG TGCACGTCAA CGAACCGGGC
CGCACCGAGT GGGAGGGGTT CGACTCCGCC ACCCGCGCCG CCGCGGCGGG CGGCGTGACC
ACGCTGATCG ACATGCCGCT CAACTCGATT CCGCCGACGG TCAACGTCGA CGCACTCAAC
GCCAAACGCG AAGCGGCATC GGGCAAGTTG CACATCGACG TCGGCTTCTG GGGCGGTGCC
ATCCCGGGCA ATACCGGCGA CCTGCGCGGC CTGCACGACG ACGGTGTGTT CGGCTTCAAG
TGCTTCCTGT TGCACTCCGG CGTCGACGAG TTCCCCCACC TCGACGCCGA CGAGATGGAA
GAGGACATGC GCGTCCTGGT GGGCTTCGAC TCCATGATGA TCGTGCACGC CGAGGACTCC
CGCGCGATCG ATCACGCCCC CACCGCCGAG GGCGACCGAT ACAGTCGCTT CCTCGCATCG
CGGCCGCGCG GCGCCGAGAA CGTGGCGATC GCCGAGGTCA TCGAGCGCGC TCGATGGACC
GGCGCCCGCG CGCACATTCT GCATCTGTCG TCCTCGGATG CTCTGCCGAT GATCGCCACG
GCCAAACGCG ACGGCGTCAG GATCACGGTC GAGACGTGCC CGCACTACCT GACACTGCTC
GCCGAGGAGA TCCCCAACGG CGCCACCGCG TTCAAGTGCT GCCCGCCGAT CCGTGAGGCA
TCCAATCGGG AACTACTGTG GCAGGGCCTG ATCGACGGCA CCATCGACTG CATCGTCTCC
GACCATTCAC CGTCGACGAT CGACCTCAAG GACGTCGAGA ACGGCGACTT CGGCGTGGCC
TGGGGTGGCG TCGCCTCGCT GCAGCTCGGT CTGTCGCTGA TCTGGACCGA GGCGAAGCGT
CGCGGTGTGG CGCTCACCCG GGTGATCGAC TGGATGGCCG CCAAGCCGGC CGAACTCGCC
GGCTTGAACA ACAAGGGCAA GATCGCGCTC GGCTACGACG CCGATTTCGC GATTTTCGAG
CCGGAGTCCG CGCAGGTGGT CGACGTGCAC AAACTGCACC ACAGGAATCC GATCACGCCG
TACGACGGGC GCGCGGTGGC CGGCGTCGTG GCGAGCACCT GGCTGCGCGG CACGAAGATC
GACTTCACCA CCCCACGGGG GCGGATGCTG CGACGCGGCG GCGTGTAG
 
Protein sequence
MTADTREQSN HADLDLVVRG ERMLTTAGIV AREIGIRDGR IVAIEPLGSG LPGAEIVELT 
DEQVMIPGLV DTHVHVNEPG RTEWEGFDSA TRAAAAGGVT TLIDMPLNSI PPTVNVDALN
AKREAASGKL HIDVGFWGGA IPGNTGDLRG LHDDGVFGFK CFLLHSGVDE FPHLDADEME
EDMRVLVGFD SMMIVHAEDS RAIDHAPTAE GDRYSRFLAS RPRGAENVAI AEVIERARWT
GARAHILHLS SSDALPMIAT AKRDGVRITV ETCPHYLTLL AEEIPNGATA FKCCPPIREA
SNRELLWQGL IDGTIDCIVS DHSPSTIDLK DVENGDFGVA WGGVASLQLG LSLIWTEAKR
RGVALTRVID WMAAKPAELA GLNNKGKIAL GYDADFAIFE PESAQVVDVH KLHHRNPITP
YDGRAVAGVV ASTWLRGTKI DFTTPRGRML RRGGV