Gene Caul_3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3878 
Symbol 
ID5901340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4198162 
End bp4201035 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content72% 
IMG OID641564400 
Product(glutamate--ammonia-ligase) adenylyltransferase 
Protein accessionYP_001685502 
Protein GI167647839 
COG category[O] Posttranslational modification, protein turnover, chaperones
[T] Signal transduction mechanisms 
COG ID[COG1391] Glutamine synthetase adenylyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCC TGCTTGAACG CCTCACCCCC TGCGGCCCCG TGGTCGATCC CAAGGCCGCC 
GAGCGGGCGC ACGAGGCGAT CGGGCGTAAG GTCGGCGCGG CCATGGCCCT GGTCGAGGCC
GCCTGGCCGG CCTTGGCCCC GGTGTTCGCC GCCTCGCCCT ACCTGGCCGG CCTGGCGCGG
CGCGACGGTC AGCGACTGCC GCTGATCCTC GACAGCGATC CCGATACACG GCTGGCCCAG
ATCCTCACCG CCGCCGAGGC CGTGGGCGCC GAACCCGACT TCGAAACCGC CCGACGGGTT
ATGCGTGAGC TGAAGGCCGA CCTGCACCTG CTGACCGCCC TGTGCGACCT GGGCGGGGTC
TGGGACCTGG ACGCGGTGAC CAGCGCCCTG ACCCGCTTCG CCGACGCCAC CCTGCACGCC
TCGCTGGCCC AGGCTGTGCG GCTGGAGGTG GCGCGCGGCG CCCTGACCCA CGTCGGCGAA
GGCCCGAGCG GCCCGGCGCC GGGCCTGTTC TGCATCGCCA TGGGCAAGCA CGGAGCCTTC
GAGCTCAACT ATTCCAGCGA CATCGACTTC TCGATCTTCT ACGCGCCCGA AGACCTGCCG
GTGGCGCAGG GCGTCGAGCC GCAGGGTGTG GCGGTGCGCA TCGCCAACCA CCTGGGCCGG
TTGCTGCAGG AGCGCACGGC CGATGGCTAT GTGTTCCGGA TCGACCTGCG CCTGCGCCCC
GACCCGTCCT CGACGCCGCC GGCCATGCCC ATCGACGCGG CGCTGGATTA CTACGAAAGC
GTCGGCCAGA ACTGGGAGCG GGCGGCCCAC ATCAAGGCGC GGATCGCGGC CGGCGACTTC
CCGCGCGGCG AAGCGTTCCT GGCCGAGTTG CAGCCGTTCA TCTGGCGCAA GAATCTCGAC
TTCGCGGCCA TCGCCGACAT CCACTCGATC AAGCGCCAGA TCCACGCCTA CAAGGTCGAC
GACCGCCTGA CCGCTCAGGG CGCGGACCTG AAGCTGGGGC GCGGCGGCAT CCGCGAGATC
GAGTTCTTCG TCCAGACCCA GCAACTGATC CTGGGCGGTC GCCATCCCGA CCTTCGCGGT
CCGCGCACGC TCGACGCCCT GGCCGCCCTG TCGGCCGCGG GTCACGTGAC GCCGGCGGAC
CGCGACTATC TGATCAAGGC CTATCGCGAC CTGCGCGCCC TGGAGCACCG GGCCCAGATG
ATCGCCGACG ACCAGACCCA CAAGCTCCCC GAATCCGACG CCGACCGCAA GAAGGTGGCG
GCGCTGTGGG GACATGGCAA CCTGCGCAGC TTTGACGCCT TGGTCGGCCA CATGCTCAAG
GGGGTGAACC GTCGCTACGG CGCGCTGTTC AAGGGCGAGG AAGAACTGTC CTCGCGGTTC
GGCAGCCTGG TGTTCACCGG GGTCGAGGAC GATCCGGAGA CCCTGGCCAC CCTCAAGCGC
ATGGGCTTTT CGCACCCCGA GCGGGTGGCC GCCACCATCC GCGGCTGGCA TCACGGTCGC
ATCGCCGCCA CCCGCACCGC GCGAGGGCGT GAGCTGTTCA CCCGCTTGGC CCCGCGCCTG
CTGGACGCCG CCAACGCCAC CGGCGCGCCG GACGACGCCT TCAACCGGTT CGGCGACTTC
TTCGCGGGGC TGAGCAGCGG CGTGCAGATC CAGTCGCTGT TCCTGGCCCA GCCGCGCCTG
TTCGAGCTGA TCGTCGAGGT CATGGCCTTC GCGCCGCGCT TGGCCAGCAC CCTGGCGCGG
CGGCCCACGG CGCTGGACGC CCTGCTGGAC CCGGCCTTCT TCGGGCCGAT GGAGATGCCC
AGGGACGCGC CGTGGGACCC GGCCGACTTC GAGGGGGCGA TGGACGCCGC GCGCCGCCTG
TTCCGCGACC AGAGCTTCCG GATCGGGGTG CGGGTGATCA GCGGCACGGC CGGGGCGCGC
GACGCCGGCG CCGCCTTCGC CGACCTGGCC GACCTGATCG TCCGCGGCCT GGCTCCGGCC
GCCCTGGCCG AGGTCGAGCG TCTGGGCGGC GCGTTCCCGG GCCAGGTCGC CGTGGTGGCC
CTGGGCAAGG CCGGGTCGCG CGAGATGACC GCCAAGTCTG ACCTGGACCT GATGACGCTG
TACGCCGCCG ACGATCCGGC CGGCATGTCG GCGATCAAGG GCTGGGGGGC CGAGAGCTTC
TACGCCCGGT TCACCCAGCG CCTGACCTCC GCGCTGTCGG CGCCGACCGG CGAGGGCACG
CTCTACGAGG TGGACCTGAA GCTGCGGCCG TCGGGGACCA AGGGGCCGGT GGCGGTCAGC
TTCGCGGCCT TCGAGGACTA TTACGACCGC GAGGCCGAGA CCTGGGAGTT GTTGGCCCTG
ACCCGGGCCC GGGTGATCTG GGCCTCGTCG CCGGATTTCC AGGCCCGGGC CGAAGGTGCG
ATCGCCGCCG CTCTGCGCCG GTCCCGCGAC CCGGCCAAGA CCGCCGCCGA CGTGCTGGAC
ATGCGCGACC TGATGGAGCG CGAGCGACCG GGCAAGGGGG ATTGGGACCT CAAGCTCACT
CCCGGCGGCC TGGTCGATAT CGAGTTCGCC GCCCAGTTCC TGCAACTGGT CCATGCGGGA
GGCGGTGGGC CGCTGGCCCA GAACACGGGC GAAGCCTTGG CGGCGCTGCG GGCCGCGGGC
CTGGGCGACA AGGCGGCCCT GACCGCTTTG GAGGCCGCGT GGCGGCTGGA GCAGGACCTG
TCGCAACTGC TGAAGGTGGC GCTGGAGAAC GGGCACGATC CCGACGCCGA ACCCAAGGCC
TTCCGGGCGC TGCTGGCCCG CGCGGGCGGC GTGCGGGAGT TCCGGGCGTT GAAGGGCAAG
CTGACCAAGG CCAAGGCCGA GGCGCGGGGG GCGTTCGAGC AGATCGTGCG GTAG
 
Protein sequence
MTRLLERLTP CGPVVDPKAA ERAHEAIGRK VGAAMALVEA AWPALAPVFA ASPYLAGLAR 
RDGQRLPLIL DSDPDTRLAQ ILTAAEAVGA EPDFETARRV MRELKADLHL LTALCDLGGV
WDLDAVTSAL TRFADATLHA SLAQAVRLEV ARGALTHVGE GPSGPAPGLF CIAMGKHGAF
ELNYSSDIDF SIFYAPEDLP VAQGVEPQGV AVRIANHLGR LLQERTADGY VFRIDLRLRP
DPSSTPPAMP IDAALDYYES VGQNWERAAH IKARIAAGDF PRGEAFLAEL QPFIWRKNLD
FAAIADIHSI KRQIHAYKVD DRLTAQGADL KLGRGGIREI EFFVQTQQLI LGGRHPDLRG
PRTLDALAAL SAAGHVTPAD RDYLIKAYRD LRALEHRAQM IADDQTHKLP ESDADRKKVA
ALWGHGNLRS FDALVGHMLK GVNRRYGALF KGEEELSSRF GSLVFTGVED DPETLATLKR
MGFSHPERVA ATIRGWHHGR IAATRTARGR ELFTRLAPRL LDAANATGAP DDAFNRFGDF
FAGLSSGVQI QSLFLAQPRL FELIVEVMAF APRLASTLAR RPTALDALLD PAFFGPMEMP
RDAPWDPADF EGAMDAARRL FRDQSFRIGV RVISGTAGAR DAGAAFADLA DLIVRGLAPA
ALAEVERLGG AFPGQVAVVA LGKAGSREMT AKSDLDLMTL YAADDPAGMS AIKGWGAESF
YARFTQRLTS ALSAPTGEGT LYEVDLKLRP SGTKGPVAVS FAAFEDYYDR EAETWELLAL
TRARVIWASS PDFQARAEGA IAAALRRSRD PAKTAADVLD MRDLMERERP GKGDWDLKLT
PGGLVDIEFA AQFLQLVHAG GGGPLAQNTG EALAALRAAG LGDKAALTAL EAAWRLEQDL
SQLLKVALEN GHDPDAEPKA FRALLARAGG VREFRALKGK LTKAKAEARG AFEQIVR