Gene Dshi_2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2639 
SymbolcarB 
ID5713537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2794515 
End bp2797832 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content66% 
IMG OID641268563 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001533973 
Protein GI159045179 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.239677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGA GAACCGATAT CAAGTCGATC ATGATCATCG GAGCCGGGCC CATCATCATC 
GGGCAGGCCT GCGAGTTCGA CTATTCCGGG GCCCAGGCCT GCAAGGCGCT GCGCGAAGAG
GGCTACCGGG TGATCCTGGT CAACTCCAAC CCCGCCACCA TCATGACCGA CCCGGGCCTG
GCCGACGCCA CTTATATCGA GCCGATCACC CCCGAGATCG TCGCCAAGAT CATCGAGAAG
GAGCGCCCGG ACGCGCTCTT GCCGACCATG GGCGGTCAGA CCGGGCTGAA CACCTCGCTC
GCGCTCGAAG AGATGGGCGT GCTGGCGAAA TACGGGGTCG AGATGATCGG CGCCAAGCGC
GAAGCCATCG AGATGGCCGA GGACCGCAAG CTTTTCCGCG AGGCGATGGA TCGCCTCGGC
ATCGAGAACC CCCGTGCCAC CATCGCCACG ACCATGGACG AATGCATGGC TGCGCTGGAC
GATATTGGCC TGCCGGCGAT CATCCGGCCC GCCTTCACCC TCGGCGGGAC CGGGGGCGGC
GTGGCCTACA ACCGGGACGA TTACGAGCAT TTCTGCAAAT CCGGGCTCGA TGCCTCGCCG
GTCAACCAGA TCCTGATCGA CGAGAGCCTG CTGGGCTGGA AAGAGTTCGA GATGGAGGTG
GTCCGCGACA AGGCGGACAA TGCGATCATC GTCTGCGCCA TCGAGAACGT CGATCCGATG
GGCGTGCATA CGGGCGATTC GATCACCGTG GCCCCGGCGC TGACCCTGAC CGACAAGGAA
TACCAGATCA TGCGCAACGG CTCGATCGCC GTGCTGCGCG AGATCGGGGT GGAAACCGGC
GGCTCCAACG TGCAGTGGGC GGTCAACCCC GCCGACGGGC GCATGGTGGT GATCGAGATG
AACCCGCGGG TGTCGCGGTC CTCGGCGCTG GCCTCCAAGG CCACGGGCTT TCCCATCGCC
AAGATCGCCG CGAAACTTGC CGTGGGCTAT ACCCTGGACG AGCTCGACAA CGACATCACC
AAGGTCACGC CCGCCAGCTT CGAGCCGACC ATCGACTATG TCGTGACCAA GATCCCGCGC
TTCGCGTTCG AAAAATTCCC CGGCGCCGAG CCGAACCTGA CCACGGCGAT GAAATCCGTG
GGCGAGGCCA TGTCCATCGG CCGCACCTTC CACGAAAGCG TGCAGAAGGC GCTCGCCTCG
ATGGAGACCG GGCTGACCGG CTTCGACGAG ATCGCCATCC CCGGGATTTC TGCGGACCAC
AGGTCGGACG CCCCCGACAC CGCCGCCGTG GTCAAGGCAC TGGCCAGGCA GACGCCCGAC
CGGCTGCGCG TGATTGCCCA GGCCATGCGC CACGGGCTGA GCGATGACGA GATTCAGGCC
GCGACCTCAT ACGATCCGTG GTTCCTCGCG CGCATCCGCG AGATCGTCGA GACCGAGGCA
CAGGTGCGCC GCGACGGCCT GCCGCTGGAG GCCGAGGGCC TGCGCAAGCT CAAGATGATG
GGCTTCACCG ATGCGCGGCT GGCCAAGCTG ACGGGCCGGG ACGAGGGCCA GGTCCGCCGC
GCCCGCACCC GGCTCGGTGT GACCGCGCAG TTCAAGCGGA TCGACACCTG CGCGGCCGAG
TTCGAGGCCC AGACGCCTTA TATGTACTCG ACCTACGAAA CCCCGGTGAT GGGCGAGGCC
GAATGCGAAT CGCGGCCCAC GGACGCCACC AAGGTGGTCA TCCTCGGTGG CGGGCCGAAC
CGGATCGGCC AGGGGATCGA GTTCGACTAC TGCTGCTGCC ATGCGTGTTT CGCGCTGACC
GAGGCGGGCT ACGAGACCAT CATGGTCAAC TGCAACCCCG AGACCGTTTC GACCGATTAC
GACACCTCGG ACCGGCTCTA TTTCGAGCCG CTGACCTTCG AGCATGTGAT GGAGATCCTG
CGCGCCGAAC AGGAGAACGG CACGCTGCAC GGGGTGATCG TCCAGTTCGG CGGCCAGACC
CCCCTGAAGC TCGCCAATGC GCTGGAGGCC GAAGGCATCC CGATCCTCGG GACCACGCCC
GATGCCATCG ATCTGGCCGA GGATCGCGAG CGGTTCCAGG CGCTGGTCAA CGACCTGGGC
CTCAAACAGC CTCACAACGC CATCGCCAGC ACCGATGCCG AGGCCTTCGC CGCCGCGGGC
GACATCGGCT TCCCGCTGGT CATCCGGCCG TCCTATGTTC TGGGCGGGCG CGCGATGGAG
ATCGTGCGCG ACATGGGTCA GTTGGAGCGG TATATCGCCG AGGCGGTGGT GGTCTCGGGC
GACAGCCCGG TACTGCTCGA CAGCTATCTC GCCGGCGCGG TGGAGTTGGA CGTGGACGCG
CTCTGCGATG GGGAGAACGT TCATGTGGCG GGCATCATGC AGCATATCGA AGAGGCGGGC
GTCCATTCCG GCGACAGCGC CTGTTCCCTG CCGCCCTATT CGCTCTCCGA CGACGTGCTG
GCGCGCATTC GCGTGCAGAC CGAGGCACTG GCGCGCGCCC TGCGGGTCAA GGGCCTGATG
AACGTGCAAT TCGCGATCAA GGATGACGAG ATCTACCTGA TCGAGGTGAA CCCGCGCGCC
TCGCGCACCG TGCCCTTCGT GGCCAAGGCC ACCGACAGCG CAATCGCGTC CATCGCCGCC
CGGCTGATGG CGGGCGAGCC CCTGTCGAAT TTCCCGCTGC GCGACCCCCT GCCCCATGAC
GCACCCGAGG ACCAGCACCT GCCGATCGGC GACCCGATGA CGCTGGCGCA CCCGGATACG
CCCTGGTTCT CGGTCAAGGA GGCGGTTCTG CCCTTCGCGC GCTTCCCGGG CGTCGACACG
ATCCTCGGCC CGGAAATGCG CTCCACCGGG GAGGTGATGG GCTGGGACCG GTCCTTCCCC
CGCGCCTTCC TGAAGGCACA GATGGGCGCG GGCACCGTAC TGCCGACGGA AGGCACGGTG
TTCCTGTCGA TCAAGGAAGC CGACAAGACC GAGATGCTGG TGGAAACGGC GGCGATGCTG
ACCGAACTGG GCCTCGACAT CGTCGCGACC AGAGGGACCG CGGCCTTCCT GAAGGATCAC
GGCATCGCCT CCAAGGTCGT CAACAAGGTC TACGAGGGCC GCCCGGACGT GGTCGACATG
CTCAAGGACG GGCGCATCGC GCTGGTGATG AACACCACCG AGGGCGCGCA GGCGGTCAAT
GACAGCCGCG AAATCCGGTC CGTCGCGCTC TATGACCGCA TCCCCTACTT CACCACGCTG
GCGGCCAGCC ATGCCGCGGC CCAGGCCATG ATCGCCCGCC GCGAGGGCGA GATCGGCGTC
CGCGCGTTGC AGGGATGA
 
Protein sequence
MPKRTDIKSI MIIGAGPIII GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPGL 
ADATYIEPIT PEIVAKIIEK ERPDALLPTM GGQTGLNTSL ALEEMGVLAK YGVEMIGAKR
EAIEMAEDRK LFREAMDRLG IENPRATIAT TMDECMAALD DIGLPAIIRP AFTLGGTGGG
VAYNRDDYEH FCKSGLDASP VNQILIDESL LGWKEFEMEV VRDKADNAII VCAIENVDPM
GVHTGDSITV APALTLTDKE YQIMRNGSIA VLREIGVETG GSNVQWAVNP ADGRMVVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDELDNDIT KVTPASFEPT IDYVVTKIPR
FAFEKFPGAE PNLTTAMKSV GEAMSIGRTF HESVQKALAS METGLTGFDE IAIPGISADH
RSDAPDTAAV VKALARQTPD RLRVIAQAMR HGLSDDEIQA ATSYDPWFLA RIREIVETEA
QVRRDGLPLE AEGLRKLKMM GFTDARLAKL TGRDEGQVRR ARTRLGVTAQ FKRIDTCAAE
FEAQTPYMYS TYETPVMGEA ECESRPTDAT KVVILGGGPN RIGQGIEFDY CCCHACFALT
EAGYETIMVN CNPETVSTDY DTSDRLYFEP LTFEHVMEIL RAEQENGTLH GVIVQFGGQT
PLKLANALEA EGIPILGTTP DAIDLAEDRE RFQALVNDLG LKQPHNAIAS TDAEAFAAAG
DIGFPLVIRP SYVLGGRAME IVRDMGQLER YIAEAVVVSG DSPVLLDSYL AGAVELDVDA
LCDGENVHVA GIMQHIEEAG VHSGDSACSL PPYSLSDDVL ARIRVQTEAL ARALRVKGLM
NVQFAIKDDE IYLIEVNPRA SRTVPFVAKA TDSAIASIAA RLMAGEPLSN FPLRDPLPHD
APEDQHLPIG DPMTLAHPDT PWFSVKEAVL PFARFPGVDT ILGPEMRSTG EVMGWDRSFP
RAFLKAQMGA GTVLPTEGTV FLSIKEADKT EMLVETAAML TELGLDIVAT RGTAAFLKDH
GIASKVVNKV YEGRPDVVDM LKDGRIALVM NTTEGAQAVN DSREIRSVAL YDRIPYFTTL
AASHAAAQAM IARREGEIGV RALQG