Gene Caul_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2201 
Symbol 
ID5899656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2398254 
End bp2399510 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID641562693 
Producthypothetical protein 
Protein accessionYP_001683827 
Protein GI167646164 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.449085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCA ACGACGATGA TCGCGCCGTT CTGCGCCATC TGACAGACCA GGGCGCCTTG 
ATCATCGATC GAGCAGTCGC CTGGTGCGCG GTCAACTCCG GGAGCCGACA TCTGGCCGGC
CTGGAGCGTC AGCGGCAAGT TTTGCTCGAC GCCTTCGGAA ACCTGCCCGC CGCTCCGGCC
GATATCCCTT TGAGCGCGTC GCCCGAGATC GGCGCTGACG GCAAGGTCGG CGAGCAGCAC
CATCCCGCGG CCATCGCCGT GGTCGTGCGT CCCGAGGCCC CCGTCCAGGT CGTGCTGACC
GGCCATTACG ACACCGTCTA TCCCGAGACC AGCGCCTTCC GAGCCGTCGC CACACGTCCC
GACGGCGCCC TGCACGGACC GGGAATCGCC GACATGAAGG GCGGAATTTC GGTGATGTTG
GCGGCGCTGG AAGCCTTCGA ACGCCATCCG CTGGCTTCCC GCGTCGGCTA CAGGGTGCTG
CTGTCGCCGG ATGAGGAGAT CGGGTCGGTC GCCTCCGCGC CGATCCTGGC TGAGTTCGCG
CGCCATGGTC ATGTCGGACT GACCTACGAG CCTGCCCTGG CCGACGGTTC CCTGGCCAGC
GCCCGCAAGG GCTCGGGCAA TTTCCACATC GTGGTCCACG GCCGCGCCGC CCATGCCGGC
CGCGACTTCG CCGCCGGTCG CAACGCCGTG ATGGAGGCCG CCCGCATCGC CCAGGCCCTG
CACGCCCTGA ACGGCCTGCG CGAGGGCGTC ACCTGCAACA TCGCCAAGAT CGACGGCGGC
TCGCCGCTCA ACATGGTGCC CGACGTCGCC GTGGTCCGGT TCAACGTCCG CTTCCCGGCA
GCCGCCGACG CGGCCTGGTT CGAGGACGAG GTCAACAAGA TCGTCGCCAA TACGGGCGAG
GGGCTTCACG CCCATCTCCA CGGCCGCATC ACCCGCGGCG CCAAGCCGTT CAATATGGCT
CAGCAGAGGC TGTTTGGGGC GGTGAAGGAG GCCGGCGCGC TACTTGGTCA GGACATCGGC
TGGAACCCAT CTGGCGGGGT CTGCGAGGGC AACAACCTGT TCGCCTCGGG CCTTCCCAAT
GTCGACACCC TCGGCGTGCG TGGCGGCGAC ATCCATAGCG AGGCCGAGCA CGCCTGGCCC
GACAGCTTCG TTGAGCGCGC CCAGCTGTCG GCCGTGATCC TGATGAAGCT GGCCAGCGGC
GAGATCGACG CTCCCGCCCT GCGCGCCGCC ATGACCGACC TCGCGGAGAC CGTTTAA
 
Protein sequence
MRLNDDDRAV LRHLTDQGAL IIDRAVAWCA VNSGSRHLAG LERQRQVLLD AFGNLPAAPA 
DIPLSASPEI GADGKVGEQH HPAAIAVVVR PEAPVQVVLT GHYDTVYPET SAFRAVATRP
DGALHGPGIA DMKGGISVML AALEAFERHP LASRVGYRVL LSPDEEIGSV ASAPILAEFA
RHGHVGLTYE PALADGSLAS ARKGSGNFHI VVHGRAAHAG RDFAAGRNAV MEAARIAQAL
HALNGLREGV TCNIAKIDGG SPLNMVPDVA VVRFNVRFPA AADAAWFEDE VNKIVANTGE
GLHAHLHGRI TRGAKPFNMA QQRLFGAVKE AGALLGQDIG WNPSGGVCEG NNLFASGLPN
VDTLGVRGGD IHSEAEHAWP DSFVERAQLS AVILMKLASG EIDAPALRAA MTDLAETV