Gene Meso_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_2043 
Symbol 
ID4181436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp2195054 
End bp2196304 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID638067939 
Productallantoate amidohydrolase 
Protein accessionYP_674601 
Protein GI110634393 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.801178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAC CCGGCGAAAA TCTGAGGATC AATGCGGACC GCCTCTGGGA TTCCATTCAC 
GAGATGGCCG CGATCGGCCC CGGCATCGCC GGCGGCAACA ACCGGCAGAC GCTGACCGAC
GAAGACGGCC AGGCACGGCA CCTCTTCAAG AAGTGGTGCG AGGAAGCCGG CATGTCCGTC
TCCGTGGACG CCATGGGCAC TATGTTCGCG CAGCGCGAGG GCACCGACCC CGATGCCCTG
CCCGTCTATG TGGGCTCCCA TCTCGATACG CAGCCGACGG GCGGCCGCTA TGACGGCGTG
CTGGGCGTCC TCGGCGGGTT GGAGGTGATC CGCAGCCTCA ATGACCTCGG CATCAAGACG
AAGCACCCGA TCGTCGTCAC CAACTGGACC AACGAGGAAG GCACCCGCTT CGCCCCGGCC
ATGCTCGCCT CCGGCGTCTT TGCCGGCATG CATGATCTCG AATGGGCCTA TGACCGGAGG
GATGCGCAGG GAAAGCGCTT CGGCGACGAG CTGGAGCGCA TCGGCTGGAA GGGCGAAGAG
CCGGTCGGCG GCCGCAAGAT GAAGGCCTTC TTCGAGCTCC ACATCGAGCA AGGCCCGATT
CTGGAGGACG AGGGGATTGA TATAGGCGTC GTCACCCACG GCCAGGGGCT CAAATGGCTC
CAGGTGACGC TCTCCGGCCG CGAGAGCCAT ACCGGCTCGA CGCCCATGCC CAAGCGGCGC
AACGCCGGGC TCGGCATGGC CCGCGTGATC GAGCTCGTCC ATGAAGTGGC GATGGACTAC
CAGCCCCACG CCGTGGGCGC CGTCGGCCAC ATGGAGGTCT ATCCCAATTC CCGCAACATC
ATCCCGGGCC AAACGGTTTT CACCATAGAC ATTCGCTCGC CCGACAAAAA AGTGCTCGAC
ATGATGGATG CGCGCATCCG ACAGGGCATT GCGACCATTT GCGATGCGAT GGATATTACC
TCCGAAATCG AGCAGGTCGG GCATTTCGAT CCCGTCACCT TCGACAAGGG ATGCGTCGAG
GCAATCCGCA AGGCCGCCGA ACGGCTCGGA TACACGCACC GCGATATCGT CTCCGGCGCC
GGGCATGATG CCTGCTGGAT CAACCGCGTG GCCCCCACGG CCATGGTCAT GTGCCCCTGC
GTCGACGGCC TCTCCCACAA CGAAGCCGAG ATGATCACCA AGGAATGGGC GCAGGCTGGC
GCCGATGTGC TTTTCCATGC TGTGGTGGAG ACGGCGGAAA TCGTTGAATG A
 
Protein sequence
MAAPGENLRI NADRLWDSIH EMAAIGPGIA GGNNRQTLTD EDGQARHLFK KWCEEAGMSV 
SVDAMGTMFA QREGTDPDAL PVYVGSHLDT QPTGGRYDGV LGVLGGLEVI RSLNDLGIKT
KHPIVVTNWT NEEGTRFAPA MLASGVFAGM HDLEWAYDRR DAQGKRFGDE LERIGWKGEE
PVGGRKMKAF FELHIEQGPI LEDEGIDIGV VTHGQGLKWL QVTLSGRESH TGSTPMPKRR
NAGLGMARVI ELVHEVAMDY QPHAVGAVGH MEVYPNSRNI IPGQTVFTID IRSPDKKVLD
MMDARIRQGI ATICDAMDIT SEIEQVGHFD PVTFDKGCVE AIRKAAERLG YTHRDIVSGA
GHDACWINRV APTAMVMCPC VDGLSHNEAE MITKEWAQAG ADVLFHAVVE TAEIVE