Gene Caul_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3944 
Symbol 
ID5901406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4270301 
End bp4271701 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID641564465 
Productglutamate--putrescine ligase 
Protein accessionYP_001685567 
Protein GI167647904 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.244314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.971464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACA AAGACAAGAA ATCCAAACCC GCCCCGAAGA CCACGCGCGG GGTGGCCTCG 
CTGGACGAGG CCAAGGACTG GTTCGCCCGG CAGAACATCG AGGAGATCGA GTGCGTCGTG
CCCGACCTGG CCGGGGTGCC GCGCGGCAAG ATCATGCCGG TGCGCAAGTT CCTGGGCACG
CCGTCGATGA ACCTGCCGCT GGCGGTGTTC TACCAGACCA TCACCGGCGA CTTCCCCGAG
TTCGAGGGAG CGGTCAATTC GGTGCAGGCC GACACCGACG TGATCCTGAC CCCGGACCTG
GCCACCCTGG CCGTGGTGCC CTGGGCCCAG GACCCGACGG CCCAGGTGAT CCATGACGCC
TTTCACCCCG ACGGCCGCCC CGTGGAGGAG AGCCCCCGCC AGGTGCTGCG CCGGGTGGTG
GCCCTCTATC GCGAGAAGGG CTGGGACCCG ATCGTCGCGC CGGAGATCGA GTTCTACCTG
GTCGAGAAGA ACACCGATCC CGACTATCCG CTGAAGCCGC CCGTGGGCCG CTCGGGCCGG
CCCGAGACCG GTCGCCAGGG CTATTCGATC GCGGCGGTCA ACGAATTCGA CGCCCTGTTC
GAGGACATGT ACGAATATTC CGAACGCCAG GGCCTGGAGA TCGACACCCT GATCCACGAG
AGCGGCGTGG CCCAGATGGA GATCAACCTG CGCCACGGCG CGCCGCTGGA GCTGGCCGAC
CAGGTGTTCA TGTTCAAGCG CACCATCCGC GAGGTGGCGC TGGAGCACGA GATCTACGCC
ACCTTCATGG CCAAGCCGAT GGCCGCCGAG CCCGGCAGCG CCATGCACAT CCACCAGTCG
ATCCTGAACA GCGACGGCGA GAACCTGTTC TCCGATCCCA AGACCGGCGA CGCCACCCCG
CTGTTCTACG CCTTCATCGC CGGCCAGCAG CGCTACCTGC CGGCGATCAT GGCGATCCTG
GCGCCTTATG TGAACTCCTA CCGCCGGATC GCGCGGGATT CGGGCGCGCC GGTCAACACC
CAGTGGGGCT ACGACAATCG CACCTGCGGC CTGCGGGTCC CGCCGTCCGA TCCGGAGAAC
CGGCGCCTCG AGAACCGCAT TCCGTCGTCG GACGCCAATC CGTACCTGGC CATCGCCGCG
GTGCTGGCCG CCGGCTATCT GGGCATGGTC CACAACCTGA CGCCCACCGC GCCCGTCGAC
ACCGACGCCA ACATCCGGGG CATCGAACTG CCGCGCAGCC TGCTGGAGTC GGTGGCCCTG
TTCGAGGAGG CCAAGCCGCT GATCGAGATC CTGGGCGGCA CGTTCTGCGC CGCCTACGCC
ACGGTGAAAC AGGCCGAGTA CGAGACCTTC ATGCGCACGA TCAGCCCGTG GGAGCGGGAG
TTCCTGTTGC TGAATGTGTA G
 
Protein sequence
MKYKDKKSKP APKTTRGVAS LDEAKDWFAR QNIEEIECVV PDLAGVPRGK IMPVRKFLGT 
PSMNLPLAVF YQTITGDFPE FEGAVNSVQA DTDVILTPDL ATLAVVPWAQ DPTAQVIHDA
FHPDGRPVEE SPRQVLRRVV ALYREKGWDP IVAPEIEFYL VEKNTDPDYP LKPPVGRSGR
PETGRQGYSI AAVNEFDALF EDMYEYSERQ GLEIDTLIHE SGVAQMEINL RHGAPLELAD
QVFMFKRTIR EVALEHEIYA TFMAKPMAAE PGSAMHIHQS ILNSDGENLF SDPKTGDATP
LFYAFIAGQQ RYLPAIMAIL APYVNSYRRI ARDSGAPVNT QWGYDNRTCG LRVPPSDPEN
RRLENRIPSS DANPYLAIAA VLAAGYLGMV HNLTPTAPVD TDANIRGIEL PRSLLESVAL
FEEAKPLIEI LGGTFCAAYA TVKQAEYETF MRTISPWERE FLLLNV