Gene Caul_4763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4763 
Symbol 
ID5902225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5147620 
End bp5148843 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content66% 
IMG OID641565283 
Productargininosuccinate synthase 
Protein accessionYP_001686381 
Protein GI167648718 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.235186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA AGCCCGTGAA GAAGGTCGTG CTCGCCTATT CTGGCGGTCT CGACACCTCG 
ATCATCCTCA AGTGGCTGCA GACCGAGTAC GGGGCGGAGG TCGTCACCTT CACCGCCGAC
CTGGGCCAGG GCGAGGAAAT CGAGCCGGCG CGCGCCAAGG CGCTGGCGGC GGGCGTGAAG
CCGGAAAACA TCTTCATCGA GGACGTGCGC GAGGAGTTCG TGCGCGATTA CGTGTTCCCG
ATGTTCCGCG CCAACACGGT CTATGAGGGC CAGTACCTGC TGGGCACCTC GATCGCCCGT
CCGCTGATCG CCAAGAAGCA GATCGAGATC GCCCGCAAGG TCGGGGCCGA CGCGGTCAGC
CACGGCGCCA CCGGCAAGGG CAACGATCAG GTCCGCTTCG AACTGGGCTA CTACGCCCTC
GAGCCCGACA TCCACGTGAT CGCCCCCTGG CGCGAATGGG ACTTCAAGTC CCGCGAGGCC
CTGCTGGACT TCGCCGAGAA GCACCAGATC CAGATCGCCA AGGACAAGCG CGGCGAGGCG
CCGTTCAGTG TCGACGCCAA CCTGCTGCAC AGCTCGTCGG AGGGCAAGGT CCTGGAGGAT
CCGGCCGTCG AGGCCCCCGA GTTCGTCCAC ATGCGCACCA TCGCGCCGGA AGACGCGCCC
GACAAGCCGC ACATCTTCAC CCTCGATTTC GAGCGCGGCG ACGCGGTGGC CATCGACGGC
GTGGCCATGA GCCCGGCCAC GATCCTGACC AAGCTCAACG AACTGGGTCA CGACAACGGC
GTCGGTCGCC TGGACCTGGT CGAGAACCGC TTCGTCGGCA TGAAGTCGCG CGGCGTTTAC
GAGACCCCGG GCGGTACGAT CCTGCTGGCC GCCCACCGGG GCATCGAATC GATCACCCTG
GATCGCGGCT CGATGCACCT GAAGGACGAG CTGATGCCGA AATACGCATC GCTGGTCTAT
AACGGCTTCT GGTTCTCGCC CGAGCGCGAG ATGCTGCAGG CGGCCATCGA CTACAGCCAG
GCCAAGGTCG CCGGCCAGGT GCGCGTCAAG CTCTACAAGG GCAATGTCAG CATCATCGGT
CGCACCAGCC CCTACAGCCT CTACGACCAG GACCTGGTCA CCTTCGAGGA GGGCAAGGTC
GCCTACGATC ACCGCGACGC CGGCGGCTTC ATCAAGCTCA ACGCCCTGCG CCTGCGCGTG
CTGGCCAAGC GCGACAAGCG CTGA
 
Protein sequence
MANKPVKKVV LAYSGGLDTS IILKWLQTEY GAEVVTFTAD LGQGEEIEPA RAKALAAGVK 
PENIFIEDVR EEFVRDYVFP MFRANTVYEG QYLLGTSIAR PLIAKKQIEI ARKVGADAVS
HGATGKGNDQ VRFELGYYAL EPDIHVIAPW REWDFKSREA LLDFAEKHQI QIAKDKRGEA
PFSVDANLLH SSSEGKVLED PAVEAPEFVH MRTIAPEDAP DKPHIFTLDF ERGDAVAIDG
VAMSPATILT KLNELGHDNG VGRLDLVENR FVGMKSRGVY ETPGGTILLA AHRGIESITL
DRGSMHLKDE LMPKYASLVY NGFWFSPERE MLQAAIDYSQ AKVAGQVRVK LYKGNVSIIG
RTSPYSLYDQ DLVTFEEGKV AYDHRDAGGF IKLNALRLRV LAKRDKR