Gene Mpe_A1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1262 
Symbol 
ID4785839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1358851 
End bp1362123 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content67% 
IMG OID640089828 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_001020459 
Protein GI124266455 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0303508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC GCACCGACCT TCAAAGCATC CTCATCATCG GCGCCGGCCC GATCATCATC 
GGTCAGGCCT GCGAGTTCGA CTACTCCGGC GCTCAGGCCT GCAAGGCGCT GCGCGAGGAG
GGCTATCGCG TCATCCTCGT CAACAGCAAT CCGGCGACGA TCATGACCGA CCCGGAGATG
GCCGATGCCA CCTACATCGA GCCGATCACG TGGTCGGTGG TCGAAAAGAT CATCGCCAAG
GAGCGCCAGA CGCATCCCGA CGAGAAGATG GCGATCCTGC CCACGATGGG CGGCCAGACC
GCGTTGAACT GCGCCCTCGA CCTGCACAAG CACGGTGTGT TGTCCAAGTA CGGCGTCGAG
ATGATCGGCG CCAACGAGCA CGCGATCGAG AAGGCCGAGG ACCGCCTGAA GTTCAAGGAC
GCGATGACCG GCATCGGCCT GCATTCGGCC AAGAGCGGCA TCGCGCACTC GATGGAGGAG
GCGCTGGCCG TCCAGCGTCG GATCACGAGC GACATCGGCG GCACCGGTTT CCCGATGGTG
ATCCGCCCGA GCTTCACGCT CGGCGGCACC GGCGGCGGCA TCGCCTACAA CCCCGAGGAG
TTCGAGGAGA TCTGCAAACG CGGCCTCGAC CTCTCGCCGA CCAAGGAACT GCTCATCGAA
GAGAGCCTGA TCGGCTGGAA GGAGTACGAG ATGGAGGTGG TCCGCGACAA GGCGGACAAC
TGCATCATCG TCTGCTCGAT CGAGAACCTC GATCCGATGG GCATCCACAC CGGCGACTCG
ATCACCGTGG CGCCGGCCCA GACGCTGACC GACAAGGAAT ACCAGCTGCT GCGCAACGCC
AGCATCGCGA TCCTGCGCGA GATCGGCGTC GACACCGGCG GCTCCAACGT GCAGTTCTCG
ATCAACCCGG ACAACGGCCG CATGGTCGTG ATCGAGATGA ATCCGCGCGT GTCGCGGTCG
TCGGCCCTGG CTTCAAAGGC CACCGGATTC CCGATCGCGA AGGTGGCCGC CAAGCTGGCG
GTCGGCTACA CGCTCGACGA ACTGCGCAAC GACATCACCG GCGGCGCGAC GCCGGCGAGC
TTCGAGCCCA GCATCGACTA CGTCGTCACC AAGATCCCGC GCTTCGCGTT CGAGAAGTTC
CCGGCCGCCG ACTCCCGCCT GACCACGCAG ATGAAGTCGG TGGGCGAGGT GATGGCGATG
GGCCGCAGCT TCCAGGAGAG TTTCCAGAAG GCGCTGCGCG GTCTCGAGAC CGGCATCGAC
GGCCTGTCCG AGCGCAGCAC CGACCGCGAG GAGATCGTCC AGGAGATCGG CGAGGCGGGT
CCGGAGCGCA TCCTCTATGT CGCCGACGCC TTCCGCATCG GCCTGAGCCG CGACGAGATC
TTCGAGGAAA CCGCGATCGA CCCATGGTTC CTGGCACAGA TCGAGCAGCT CGTGCAGGCG
GAGCTGGCGC TGAAGGGCCG CACGCTGGCC AGCCTGTCGA CGGACGAGCT GCGCTTCCTC
AAGCGCAAGG GCTTCTCCGA CAAGCGCCTG GCCAAGCTGC TCGGCACCCA CCAGCATGAG
GTGCGCGCCG CGCGCCACGC ACAGGGCGTG CGGCCGGTCT ACAAGCGCGT CGACACCTGT
GCGGCCGAGT TCGCCACGCA GACCGCCTAC ATGTACTCGA CGTACGACGA CGAGTGCGAG
GCGCAGCCGA CCGACCGCAA GAAGATCATG GTGCTCGGCG GCGGCCCGAA CCGCATCGGC
CAGGGCATCG AGTTCGACTA CTGCTGCGTG CACGCCGCGC TGGCGATGCG CGAGGACGGC
TACGAGACCA TCATGGTCAA CTGCAACCCC GAGACCGTGT CGACCGACTA CGACACCTCG
GACCGGCTTT ACTTCGAGCC GGTGACGCTG GAAGACGTGC TGGAGATCGT CGACAAGGAA
AAGCCGGTCG GCGTGATCGT GCAGTACGGA GGCCAGACGC CGTTGAAGCT GGCGCTCGAC
CTGGAGCGCG CCGGCGTGCC GATCGTCGGC ACCTCGCCGG ACAGCATCGA CATCGCGGAA
GATCGCGAGC GCTTCCAGCA ATTGCTGCAC AAGCTCGGTC TGAAGCAGCC TCCGAACCGC
ACCGCGCGCA CCGAGGAGGC CGCGCTGCAG CTGGCGCAGG AGATCGGCTA CCCCCTGGTG
GTGCGCCCGA GCTACGTGCT GGGCGGCCGT GCGATGGAGA TCGTGCATGG CGACAAGGAC
CTCGAGCGCT ACATGCGCGA GGCGGTCCGC GTGTCCGAGA AGTCGCCGGT GCTGCTCGAC
CGCTTCCTCG ATGACGCGGT GGAGGTCGAT GTCGACTGCA TCTCCGACGG CCACGACGTG
ATGATCGGCG CGATCATGGA GCACATCGAG CAGGCCGGCG TGCACTCCGG CGACTCGGCG
TGCTCGCTGC CGCCGTATGC GCTGAGCCCG GCGCTGCAGG ATGAACTGCG GCGCCAGACC
GCGGCGATGG CGAAGGCGCT GCAGGTCGTG GGCCTGATGA ACGTGCAGTT CGCGATCCAG
GGCGAGGGCG ACGAGGCGGT CGTCTACGTG CTCGAGGTCA ACCCGCGCGC GTCGCGCACC
GTACCCTTCG TCTCCAAGGC CACCGGCCAG CCGCTGGCCA AGATCGCGGC GCGCTGCATG
GTCGGCCAGA AGCTGGCCGC GCAGAAGGCC CTGAGGGGCG AGGCGCCGCG CGAGATCGTG
CCCTCGTACT ACAGCGTCAA GGAAGCGGTG TTCCCGTTCA ACAAGTTCCC CGGCGTCGAT
CCCATCCTCG GCCCCGAGAT GCGCTCGACC GGCGAGGTGA TGGGGGCGGG CCGCAGCTTC
GGCGAGGCCA TGCTCAAGAG CCAGCTCGGC GCCGGCTCGC GCCTGCCGTC CAGGGGCACG
GTCGTCATCA CGGTGAAGAA CGCCGACAAG GATCGCGCGG TCAAGATCGC CGGCGATCTG
GTCGATCTCG GCTTCAACGT CGTCGCCACC AGGGGCACGG CAGCGGCCAT CTCGGCGGCC
GGCGTGCCGG TGAAGGTGGT CAACAAGGTC AAGGACGGTC GGCCGCACAT CGCCGACATG
ATCAAGGCCG GCGAGATCCA GCTGGTATTC ACGACCGTCG ACGAGACCCG TACCGCGATC
GCGGATTCGC GCTACATCCG CACCGCGGCG ATCGCCAACC GCGTCAGTTA CTACACGACC
ATGGCCGGCT GCGAGGCCGC GGTCGAGGCA CTGAAGCATC AGGACGATCT GACCGTGCTT
TCCCTGCAGG AACTGCACGC GGAACTCCAC TAA
 
Protein sequence
MPKRTDLQSI LIIGAGPIII GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIT WSVVEKIIAK ERQTHPDEKM AILPTMGGQT ALNCALDLHK HGVLSKYGVE
MIGANEHAIE KAEDRLKFKD AMTGIGLHSA KSGIAHSMEE ALAVQRRITS DIGGTGFPMV
IRPSFTLGGT GGGIAYNPEE FEEICKRGLD LSPTKELLIE ESLIGWKEYE MEVVRDKADN
CIIVCSIENL DPMGIHTGDS ITVAPAQTLT DKEYQLLRNA SIAILREIGV DTGGSNVQFS
INPDNGRMVV IEMNPRVSRS SALASKATGF PIAKVAAKLA VGYTLDELRN DITGGATPAS
FEPSIDYVVT KIPRFAFEKF PAADSRLTTQ MKSVGEVMAM GRSFQESFQK ALRGLETGID
GLSERSTDRE EIVQEIGEAG PERILYVADA FRIGLSRDEI FEETAIDPWF LAQIEQLVQA
ELALKGRTLA SLSTDELRFL KRKGFSDKRL AKLLGTHQHE VRAARHAQGV RPVYKRVDTC
AAEFATQTAY MYSTYDDECE AQPTDRKKIM VLGGGPNRIG QGIEFDYCCV HAALAMREDG
YETIMVNCNP ETVSTDYDTS DRLYFEPVTL EDVLEIVDKE KPVGVIVQYG GQTPLKLALD
LERAGVPIVG TSPDSIDIAE DRERFQQLLH KLGLKQPPNR TARTEEAALQ LAQEIGYPLV
VRPSYVLGGR AMEIVHGDKD LERYMREAVR VSEKSPVLLD RFLDDAVEVD VDCISDGHDV
MIGAIMEHIE QAGVHSGDSA CSLPPYALSP ALQDELRRQT AAMAKALQVV GLMNVQFAIQ
GEGDEAVVYV LEVNPRASRT VPFVSKATGQ PLAKIAARCM VGQKLAAQKA LRGEAPREIV
PSYYSVKEAV FPFNKFPGVD PILGPEMRST GEVMGAGRSF GEAMLKSQLG AGSRLPSRGT
VVITVKNADK DRAVKIAGDL VDLGFNVVAT RGTAAAISAA GVPVKVVNKV KDGRPHIADM
IKAGEIQLVF TTVDETRTAI ADSRYIRTAA IANRVSYYTT MAGCEAAVEA LKHQDDLTVL
SLQELHAELH