Gene Phep_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0124 
SymbolcarB 
ID8251209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp146943 
End bp149759 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content45% 
IMG OID644933774 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_003090412 
Protein GI255530040 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAG ACACCTCCAT ACGCTCAGTA CTAATTATCG GATCGGGCCC TATCATCATT 
GGCCAAGCCT GCGAATTTGA CTATTCGGGA TCTCAAGCCG CCTTATCTTT AAAAGAAGAG
GGAATTGAAG TTTCGATCAT CAATTCAAAT CCTGCAACCA TCATGACCGA CAAAGTGATT
GGGGACCATG TTTACCTCTG GCCGCTAACG GTAGATTCTA TTGAGCAGAT TTTGCAGGAA
CGCAAAATTG ATGCTGTACT GCCTACTATG GGTGGACAAA CAGCGTTGAA TCTTTGTATT
GAAGCATCTG AACGGGGCAT TTGGGAAAAA TACGGTGTTA AAGTGATCGG TGTGGATGTT
GCAGCAATCG AGAAAACTGA AAACCGGGAA GCTTTCCGCC AGTTAATGGT TGATATAGGT
GTAGGGGTAG CTACTTCAAA AATTGCCAAC TCTTTCCTGG AAGGTAAAGA AGCAGCCCAG
GAAATTGGTT ATCCATTGGT TATCCGCCCT TCTTATACAT TGGGTGGTTA TGGAGGTGGT
TTCGTACACA AAAAAGAAGA ATTTGACCAG GCTTTAAAAC GCGGGCTTGA GGCTTCTCCT
ACCCATGAGG TACTGGTAGA GCAAGCCGTT TTAGGCTGGA AAGAATATGA ACTGGAGTTG
TTAAGGGACA GTAATGACAA TGTGATCATT ATCTGTTCTA TTGAAAACTT CGATCCAATG
GGTATCCATA CAGGAGATTC CATCACCGTA GCACCGGCAA TGACCTTGTC CGACCGTTGC
TACCAGGATA TGCGTAACCA GGCTATCCGC ATGATGCGCG CCATCGGAAA CTTCGCCGGA
GGCTGTAATG TACAGTTCTC GGTAAATCCC GACAACGATG AGATCATTGC CATCGAGATC
AACCCAAGGG TATCCCGCTC GTCGGCCCTG GCCAGTAAAG CAACAGGATA TCCTATTGCC
AAAATTGCGG CTAAACTGGC TATCGGGTAT AACCTGGACG AACTGGAAAA CCAGATTACA
AAAACAACCT CGGCCTACTT TGAGCCTACT TTAGACTATG TAATCGTAAA AGTACCCCGC
TGGAACTTTG ATAAATTTAA AGGCGCCAAT ATGGAGCTGG GCCTGCAGAT GAAATCGGTA
GGTGAGGTGA TGGCTATCGG CCGTACTTTT ATCGAAGCCC TCCAAAAAGC TTGTCAGAGC
TTAGAGATCA GCCGCGCAGG TCTGGGTGCA GATGGTAAGC ACAAAAGAAA CATCGACGAA
ATCATGTATG GCCTGGAGCA TGCCAGCTGG AACAGGTTGT TCCTGATCAA GGATGCCATG
AGTATGGGTG TGCCACTTGA GTCGATCCGT AAAGTTACCC GTATCGACAA ATGGTTCCTG
TCACAAATAC AGGAATTGGT GCAGCTGGAA ACAGAGCTGA AACGTTATTC ACTAAATAAT
ATTCCTAAAG ACTTCTTCTT TACCCTAAAG CAGAAAGGTT TTTCTGATAT CCAGATTGCC
TATCTGCTGG GCAATGTAAC CGAAGATGAA GTATACGAGC GCAGAAAGTC GTTGGGCATC
AAACGCGTTT ACAAAATGGT GGATACCTGT GCTGCCGAGT TTGCTGCAAA AACACCTTAC
TATTACTCAA CTTTTGAGGA AGAGAACGAA TCTTTGCCAT CTGATAAAAA GAAAGTGATT
GTACTTGGCT CTGGTCCTAA CCGTATCGGG CAGGGTATAG AGTTCGACTA TTCTTGTGTG
CATGGTCTGC TTGCCGCCAA AGAAACCGGT TTCGAAGCCA TCATGATCAA CTGTAACCCA
GAGACGGTAT CTACCGACTT TAACATGGCC GATAAACTAT ACTTTGAACC GGTATTCTGG
GAGCATGTAC GAGAAATTAT AGAACTGGAA AACCCTGTAG GTGTAATTGT ACAGCTAGGC
GGACAAACGG CCTTAAAGAT GGCCGAGAAG CTGCACGAGA ATGGCATTAA GATTATCGGC
ACTTCTTATA ACGACATGGA TGTGGCCGAA GACAGGGGCC GTTTCTCAGA CCTTTTAAAG
GAACTTGATA TTCCCTATCC AAAATATGGT GTTGCAGAAA ATGCCGAAGA AGCAATAGTT
GTAGCAAATG AAGTGGGTTA TCCGGTATTG GTCAGACCAA GCTATGTATT GGGCGGACAG
GGAATGAGCA TCGTGATCAA CGATGAAGAC CTGGAAAAAG CGGTCGTGAA ATTATTGGGA
GACCTTCCCG GTAACCGTGT ATTGATCGAT CATTTCCTGG ATAGGGCAGA AGAAACAGAG
TCTGATTCGA TCAGTGATGG GGAAGATGTA CATATCGTTG GATTGATGGA ACATATTGAG
CCAGCTGGTA TCCACTCCGG AGATTCCAGT GCGGTATTGC CTCCGTTCAG CTTATCTGAA
AAGGTAATGA ACGACATGGA AACTTACTCT AAAAAGATTG CAAGAGCACT GAATGTCATC
GGCCTGTTGA ACATCCAGTT TGCAGTTAAA GATGAAAAAG TATATGTGAT TGAGGCCAAT
CCAAGGGCTT CCAGGACGGT TCCTTTCATT GCTAAAGCTT ATGATGTGCC TTACATCAAT
ATTGCCGCCA AGGTGATGCT GGGTGTAAAT AAACTGAAAG ACTTTACCAT CGAGCGCAAA
CTGAAAGGTT ATGCCATTAA AGAACCGGTA TTCTCCTTCA ACAAATTCCC TGAGGTAACT
AAAGAATTAG GCCCGGAAAT GAAATCGACC GGCGAGGCCA TCAGGTTCAT TAAAGACACC
GAAGATCCTT ACTTCAGAAA ATTGATTAAA GATAAATCGA TGTATTTGTC CAAGTAA
 
Protein sequence
MPKDTSIRSV LIIGSGPIII GQACEFDYSG SQAALSLKEE GIEVSIINSN PATIMTDKVI 
GDHVYLWPLT VDSIEQILQE RKIDAVLPTM GGQTALNLCI EASERGIWEK YGVKVIGVDV
AAIEKTENRE AFRQLMVDIG VGVATSKIAN SFLEGKEAAQ EIGYPLVIRP SYTLGGYGGG
FVHKKEEFDQ ALKRGLEASP THEVLVEQAV LGWKEYELEL LRDSNDNVII ICSIENFDPM
GIHTGDSITV APAMTLSDRC YQDMRNQAIR MMRAIGNFAG GCNVQFSVNP DNDEIIAIEI
NPRVSRSSAL ASKATGYPIA KIAAKLAIGY NLDELENQIT KTTSAYFEPT LDYVIVKVPR
WNFDKFKGAN MELGLQMKSV GEVMAIGRTF IEALQKACQS LEISRAGLGA DGKHKRNIDE
IMYGLEHASW NRLFLIKDAM SMGVPLESIR KVTRIDKWFL SQIQELVQLE TELKRYSLNN
IPKDFFFTLK QKGFSDIQIA YLLGNVTEDE VYERRKSLGI KRVYKMVDTC AAEFAAKTPY
YYSTFEEENE SLPSDKKKVI VLGSGPNRIG QGIEFDYSCV HGLLAAKETG FEAIMINCNP
ETVSTDFNMA DKLYFEPVFW EHVREIIELE NPVGVIVQLG GQTALKMAEK LHENGIKIIG
TSYNDMDVAE DRGRFSDLLK ELDIPYPKYG VAENAEEAIV VANEVGYPVL VRPSYVLGGQ
GMSIVINDED LEKAVVKLLG DLPGNRVLID HFLDRAEETE SDSISDGEDV HIVGLMEHIE
PAGIHSGDSS AVLPPFSLSE KVMNDMETYS KKIARALNVI GLLNIQFAVK DEKVYVIEAN
PRASRTVPFI AKAYDVPYIN IAAKVMLGVN KLKDFTIERK LKGYAIKEPV FSFNKFPEVT
KELGPEMKST GEAIRFIKDT EDPYFRKLIK DKSMYLSK