Gene Caul_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3952 
Symbol 
ID5901414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4279190 
End bp4280569 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content69% 
IMG OID641564473 
Productglutamine synthetase catalytic region 
Protein accessionYP_001685575 
Protein GI167647912 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0296195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGG TCGCCGATCC CCAGGAATGC CGCGACTTCC TCGCCGCCCA TCCGCAGGTG 
AAGTACGTCG ACGTGTTCTT CACCAGCATG ACCGGCGTGC CCCGTGGGAA ACGCCTCAGG
ATCCACGAGC TGCAGGCGGT CTACGACTAT GGCCGCTTCC TGCCGGGCTC GATCCTGGTG
GTCGACACCA ACGGCGCCGA CTGCGAGGAG ACGGGCCTGG TCTGGGAGGA CGGCGACGCC
GACCGCCGGG CGCGGCCCGT GCCCGGAACC CTGACGCTCG CGCCCTGGCT GGGGCCGGAC
ATGGCCCAGG TGATGCTGTC GCTGTACGAG CTGGACGGCG CGGCCAACGA CCTGGATCCG
CGCCATGTGC TCAAGCGCGT GCTGGACCGC TTCGCCGCCG ACGGCCTGAC GCCGGTCGCG
GCCTGCGAGC TGGAATATTA CCTGGTCGAC CAGCAGCGCG GTCCGAACGG CGAGTTGCTG
CCGGCCCGGT CGCTGCAGAC CGGCGAGCGG CCCCATGGCA TTCAGGTCTA TGGCCTGCCG
GAGCTGGAGG CGATCTCGCC GTTCCTGCGC GAGCTGTGGG AGACCTGCGA CGTGCTGGGC
GTGCCGCTGG AGGGGGCGAT CTCGGAGTTC GCGCCGGGCC AGGTCGAGCT GACCCTCAAG
CACAAGCCCG ACGCCCTGGC CTGCGCCGAC GACGCCCTGC GCTACAAGCG GGCCGCCAAG
GGCGTGGCCC TGCGCCATGG ATGCGAGGCC ACCTTCATGG CCAAGCCCTG GGCCGACCAG
GCCGGCAACG GCTTCCACGT GCATGTCAGC TTCAACGACG CGGCGGGAAA CAACCTGTGC
GCCGCCGAGG ATCCGGAGGG CTCGGCGCTG CTCAAGCACG CGATCGGCGG CATGAAGGTG
CTGATGGCCG AGTGCATGGC CATCCTCGCG CCCAACGCCA ACAGCTATCG CCGTTTCAAG
GCCAACTCCT ACGCGCCCGT CGCCCCGACC TGGGGCGTCA ACAATCGCAC CGTATCCTTG
CGCGTGCCGG CCGGCCCGCC GCCGACCCGG CATGTGGAGC ACCGCGTGGC CGGCGCCGAC
GCCAATCCGT ACCTGGTGCT GGCCGTGCTG CTGGCCTGCG CCCACCACGG CATCGCCAAC
AAGATCGATC CGGGTCCAGC GGTGGTCGGC GACGGCTACG CGGCCGCGGC CAAGGAGAAG
AGCCGCCTGC CGACCGACTG GTATGCGGCC GTCAACCTGT TCGAAGCCTC CGACGTGCTG
CGCGACTATC TGGGCGCGCG GTTCGTGGAG ATGTTCGTCT CGGTCAAGCG CACCGAGCAG
GCGCGCTTCG CCGAGGTGGT CACGTCGCTG GATTATGACT GGTATCTGCG CAACGCGTGA
 
Protein sequence
MNMVADPQEC RDFLAAHPQV KYVDVFFTSM TGVPRGKRLR IHELQAVYDY GRFLPGSILV 
VDTNGADCEE TGLVWEDGDA DRRARPVPGT LTLAPWLGPD MAQVMLSLYE LDGAANDLDP
RHVLKRVLDR FAADGLTPVA ACELEYYLVD QQRGPNGELL PARSLQTGER PHGIQVYGLP
ELEAISPFLR ELWETCDVLG VPLEGAISEF APGQVELTLK HKPDALACAD DALRYKRAAK
GVALRHGCEA TFMAKPWADQ AGNGFHVHVS FNDAAGNNLC AAEDPEGSAL LKHAIGGMKV
LMAECMAILA PNANSYRRFK ANSYAPVAPT WGVNNRTVSL RVPAGPPPTR HVEHRVAGAD
ANPYLVLAVL LACAHHGIAN KIDPGPAVVG DGYAAAAKEK SRLPTDWYAA VNLFEASDVL
RDYLGARFVE MFVSVKRTEQ ARFAEVVTSL DYDWYLRNA