Gene TM1040_0632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0632 
SymbolcarB 
ID4076119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp672273 
End bp675635 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content61% 
IMG OID638005929 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_612627 
Protein GI99080473 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.175677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA GAACCGATAT CAAGTCGATC ATGATCATTG GCGCCGGGCC CATCGTCATC 
GGTCAGGCCT GCGAATTCGA CTATTCTGGC GCTCAGGCCT GCAAGGCGCT GCGCGAAGAA
GGCTACCGGG TGATCCTGGT GAACTCCAAC CCAGCGACGA TCATGACCGA CCCGGGTCTT
GCTGATGCGA CCTACATCGA GCCGATCACC CCGGAAGTGG TCGCCAAGAT CATCGAGAAA
GAACGCCCCG ACGCGCTCCT GCCGACGATG GGCGGGCAGA CCGGCCTCAA CACCTCGCTC
GCGCTTGAGG AAATGGGCGT CCTCGAAAAA TTCAACGTCG AGATGATCGG CGCCAAGCGC
GAAGCCATCG AGATGGCCGA AGACCGCAAG CTCTTCCGCG AGGCCATGGA TCGCCTCGGT
CTTGAAAACC CGCGCGCCAC CATTGTTACA GCGCCGAAAA AAGACAACGG CAACGCCGAC
CTTGAGGCGG GTGTCGCGCT TGCACTCGAA GCCCTCGAGG ACATCGGCCT GCCCGCAATC
ATCCGCCCCG CCTTTACCCT CGGTGGCACC GGTGGCGGCG TGGCCTACAA TCGCGAGGAT
TACATCCACT ATTGCCGCTC CGGCATGGAT GCCTCTCCGG TGAACCAGAT CCTCGTCGAT
GAGAGCCTGC TGGGCTGGAA AGAATACGAG ATGGAAGTGG TCCGCGACAA AGCGGATAAT
GCCATCATCG TCTGCTCGAT CGAAAACGTG GACCCGATGG GCGTTCATAC CGGGGACTCG
ATCACCGTGG CCCCTGCCCT TACGCTTACG GACAAGGAAT ATCAGATGAT GCGGACCGCC
TCGATTGCGG TCCTGCGCGA GATTGGCGTG GAAACCGGCG GCTCCAACGT ACAATGGGCA
GTGAACCCCG CAGACGGGCG GATGGTTGTC ATCGAGATGA ACCCGCGCGT GAGCCGCTCT
TCGGCGCTGG CCTCCAAAGC GACAGGTTTC CCGATTGCAA AGATCGCGGC AAAGCTTGCT
GTGGGCTACA CGCTCGACGA GTTGGACAAC GACATCACCA AGGTGACGCC TGCATCGTTT
GAGCCGACCA TCGACTATGT CGTCACCAAA ATTCCGAAAT TCGCGTTTGA GAAATTCCCT
GGTTCCGAGC CTTACCTCAC GACAGCGATG AAATCGGTGG GCGAAGCCAT GGCGATTGGC
CGCACCATCC ACGAATCGAT GCAAAAGGCG CTCGCTTCGA TGGAATCCGG TCTCACCGGC
TTTGACGAGG TGGAGATCCC CGGTGTGCAA GCTGGCCTTT GGGAAAGCGT TGGTGCCGAC
GACAAGGCCG CAGTGATCAA GGCGATCAGC CAGCAGACCC CGGACCGCCT GCGCACCATC
GCACAGGCGA TGCGTCATGG CTTGTCGGAC GACGAAATTC AGGGCGTCAC GAAATTCGAT
CCGTGGTTCC TCGCCCGGAT CCGCGAGATC ATCGACGCCG AGCGCGAGAT CCGCAAAAAT
GGCCTACCGA TGCGCGAAGA CAAGCTGCGC GCGCTCAAGA TGCTCGGCTT TTCGGACGCC
CGTCTGGGTC TGCTGACGGG CCGTGACGAG GACAACGTGC GCCGCGCGCG CCACAACCTC
GGCGTCAAGG CGGTGTTCAA ACGCATCGAC ACCTGCGCCG CAGAGTTCGA AGCGCAGACG
CCCTATATGT ACTCCACCTA TGAAAGCCCG ATGATGGGTG AAGTGGAATG CGAAGCGCGC
CCCTCGGATC GCAAAAAGGT CGTTATTCTT GGTGGCGGGC CAAACCGGAT CGGTCAGGGT
ATCGAGTTCG ACTACTGCTG CTGTCACGCC TGTTTTGCGC TGACGGATGC GGGGTATGAG
ACCATCATGG TCAACTGCAA CCCGGAAACA GTTTCGACCG ACTATGACAC CTCGGATCGC
CTCTATTTCG AGCCCCTCAC CATGGAGCAC GTCATGGAGA TCCTGCGCGT CGAACAGGAA
AACGGCACCC TGCACGGTGT GATTGTTCAG TTCGGTGGCC AAACCCCGCT GAAACTTGCC
AATGCGCTAG AGGCCGAAGG CATTCCGATC CTCGGCACCA CGCCGGACGC GATTGACCTT
GCCGAAGACC GTGAGCGCTT CCAGGCGCTT GTGAATGAGC TTGGCCTGAA ACAGCCCAAG
AACGGCATCG CTTCCACCGG CGAACAAGCG CTTAAGATCG CAGAGGAAAT CGGCTTCCCG
CTGGTGATCC GCCCGTCCTA CGTTCTGGGT GGTCGCGCGA TGGAAATCGT GCGCGACATG
GACCAGCTCA AACGCTACAT CAACGAGGCA GTGGTGGTAT CGGGCGACAG CCCGGTGCTC
TTGGACAGCT ACCTCTCTGG CGCGGTGGAG CTCGACGTGG ACGCGATCTG CGACGGCAAA
GACGTGCATG TTGCAGGCAT CATGCAGCAT ATCGAGGAAG CTGGCGTTCA CTCTGGTGAC
TCGGCGTGCT CGCTGCCGCC GTACTCGCTC GACAAAGAGG TGATCGAGCG TATCAAGGAG
CAGAGCTTTG CGCTTGCGAA GGCGCTGAAT GTTGTTGGTC TGATGAACGT GCAATTTGCG
ATCAAGGACA ATGAGATCTA CCTGATTGAG GTAAACCCGC GCGCCTCGCG CACGGTGCCG
TTTGTCGCCA AGGCCACCGA CAGCGCCATC GCCTCCATCG CCGCGCGCGT CATGGCCGGA
GAGCCGCTGT CGAACTTCCC GCAGCGCGCA CCCTACGAGC CCGACGCAGG CTATGACGTG
AACACACCCA TGGCTGATCC GATGACGCTT GCTGACCCGG ACATGCCGTG GTTCTCCGTC
AAAGAAGCGG TGCTGCCCTT TGCCCGTTTC CCCGGCGTCG ACACCATTCT GGGGCCGGAA
ATGCGCTCTA CCGGTGAAGT CATGGGCTGG GATCGCAGCT TTGCGCGTGC CTTCCTCAAG
GCGCAGATGG GCGCTGGCAT GGTGCTGCCC AGCAAAGGAC GCGCGTTCAT TTCGATCAAG
GATGAGGACA AGACCGAAGT CATGCTCGAT ACGGCGCGCA TCCTGATCGC CCAGGGCTTT
GATCTGGTGG CCACCCGCGG CACGCAGGGG TGGCTTGCGG GGCACGGTGT AGAGTGCGCT
GTGGTGAACA AGGTCTATGA AGGACGTCCG CATGTGGTGG ACATGCTCAA GGATGGCGAG
ATCCAGCTGG TGCTCAACAC CACCGAAGGC AACCAGGCGG TCGAGGATTC CAAGCCGATG
CGCTCTGTCG CGCTCTATGA CAAGATCCCC TATTTCACCA CCGCCGCCGG GGCCCATGCC
GCCGCGCGCG CCATTCAGGC GCAGGCCGAA GGGGAAGTCG AAGTGAAAAG CCTGCAAGGC
TAA
 
Protein sequence
MPKRTDIKSI MIIGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPGL 
ADATYIEPIT PEVVAKIIEK ERPDALLPTM GGQTGLNTSL ALEEMGVLEK FNVEMIGAKR
EAIEMAEDRK LFREAMDRLG LENPRATIVT APKKDNGNAD LEAGVALALE ALEDIGLPAI
IRPAFTLGGT GGGVAYNRED YIHYCRSGMD ASPVNQILVD ESLLGWKEYE MEVVRDKADN
AIIVCSIENV DPMGVHTGDS ITVAPALTLT DKEYQMMRTA SIAVLREIGV ETGGSNVQWA
VNPADGRMVV IEMNPRVSRS SALASKATGF PIAKIAAKLA VGYTLDELDN DITKVTPASF
EPTIDYVVTK IPKFAFEKFP GSEPYLTTAM KSVGEAMAIG RTIHESMQKA LASMESGLTG
FDEVEIPGVQ AGLWESVGAD DKAAVIKAIS QQTPDRLRTI AQAMRHGLSD DEIQGVTKFD
PWFLARIREI IDAEREIRKN GLPMREDKLR ALKMLGFSDA RLGLLTGRDE DNVRRARHNL
GVKAVFKRID TCAAEFEAQT PYMYSTYESP MMGEVECEAR PSDRKKVVIL GGGPNRIGQG
IEFDYCCCHA CFALTDAGYE TIMVNCNPET VSTDYDTSDR LYFEPLTMEH VMEILRVEQE
NGTLHGVIVQ FGGQTPLKLA NALEAEGIPI LGTTPDAIDL AEDRERFQAL VNELGLKQPK
NGIASTGEQA LKIAEEIGFP LVIRPSYVLG GRAMEIVRDM DQLKRYINEA VVVSGDSPVL
LDSYLSGAVE LDVDAICDGK DVHVAGIMQH IEEAGVHSGD SACSLPPYSL DKEVIERIKE
QSFALAKALN VVGLMNVQFA IKDNEIYLIE VNPRASRTVP FVAKATDSAI ASIAARVMAG
EPLSNFPQRA PYEPDAGYDV NTPMADPMTL ADPDMPWFSV KEAVLPFARF PGVDTILGPE
MRSTGEVMGW DRSFARAFLK AQMGAGMVLP SKGRAFISIK DEDKTEVMLD TARILIAQGF
DLVATRGTQG WLAGHGVECA VVNKVYEGRP HVVDMLKDGE IQLVLNTTEG NQAVEDSKPM
RSVALYDKIP YFTTAAGAHA AARAIQAQAE GEVEVKSLQG