Gene Nham_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2041 
Symbol 
ID4029304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2263851 
End bp2264888 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID637970498 
Productallophanate hydrolase subunit 2 
Protein accessionYP_577299 
Protein GI92117570 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC TTGTCGCCGC TGCTGCCGGG CCCGCGACGT CGGTTCAGGA CGGCGGGCGC 
TACGGCGTGC AGCGCTATGG CCTGACGCCG AGCGGCGCGG TGGATCGTCT CGCGCTGGCT
GCCGCCAATT GCCTGGTCGG CAATCCGCCC TTTGCCGCTG CGATCGAAGT CGGGCCGTTC
GGCGCGGCTT TCGTCGCCCG CGAGGGCAAG GTGCGCGTCG CGCTCGCGGG CGCCGTGCGG
AACGCTGAAG TCGCGGGGCA CCCGGTGTCG TTCAACGAAT CCCGCACGCT CGGTGACGGC
GAAAGCCTGA CGCTCGGCTT CGCACGCGAC GGGACCTTCA GCTATCTCGC CATTGAGGGT
GGCGTGAGAG GCGAGCCGAC GTTCGGCAGC CTCGCCGTCA ACGCGCGTGC CGGCCTTGGC
AGTCCGTTTC CGCGGCCATT GCAGGCCGGC GACGTCCTTG ATGTCGATGC TGCAAAGGCT
ACGATCGAGC GGCGGATCGA CCTGCCCGCC GTATCCGATG GTCCGATCCG CGTCGTGATG
GGTCCGCAGG ACGACGAATT CGGCGAGGCG ACGGATCTGT TCCTCCGCAG CGAGTGGAAG
ATATCGGCGA CAAGCGACCG CATGGGCTAT CGCCTTGAAG GACCCGTTAT CAAGCATCTG
CATGACCACA ACATCGTCTC CGACGGCACC GTGAACGGCA GCATTCAGGT TCCCGGCAAC
GGACAGCCGA TCGTGCTGAT GCCGGATCGC GGCACCAGCG GCGGCTATCC GAAAATCGCG
ACCGTGATCA CCGCCGACCT CGGTCGCTTC GCACAAATCC CCGCCGGCCA CACCTTCCGC
TTCCAGGCGG TCACCATGAC TGATGCCCAG GCCGCGGCGC GCGCGATGGC GGACCTGTTG
CAAACCCTCC CCGATCGCGC CCGCGAGGTG CGCAATGTCG ACATCAGCGA CGCGCTGCAG
AACGCCAATA TCGCCGGCTC TGCGGTGAAT GCATTCGACA GCGGAACGTG GCAAACTTGG
ACACCTCCGG AGCCATAG
 
Protein sequence
MSKLVAAAAG PATSVQDGGR YGVQRYGLTP SGAVDRLALA AANCLVGNPP FAAAIEVGPF 
GAAFVAREGK VRVALAGAVR NAEVAGHPVS FNESRTLGDG ESLTLGFARD GTFSYLAIEG
GVRGEPTFGS LAVNARAGLG SPFPRPLQAG DVLDVDAAKA TIERRIDLPA VSDGPIRVVM
GPQDDEFGEA TDLFLRSEWK ISATSDRMGY RLEGPVIKHL HDHNIVSDGT VNGSIQVPGN
GQPIVLMPDR GTSGGYPKIA TVITADLGRF AQIPAGHTFR FQAVTMTDAQ AAARAMADLL
QTLPDRAREV RNVDISDALQ NANIAGSAVN AFDSGTWQTW TPPEP