Gene Caul_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3661 
Symbol 
ID5901116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3951107 
End bp3952513 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content70% 
IMG OID641564172 
Productpeptidase M28 
Protein accessionYP_001685286 
Protein GI167647623 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.21215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGT TCCGCACTCT GCTGACCGCC GCCGCCGTTT CAGCCCTGAT GGCCGGGGCG 
TCTCACGCCC AGGACGCTCG CACCGCCGAG GCGTTGCGGG ACAAGGCCCT GCTGGACCGC
ACCGCCTGGG ATGTCACCGA GGACCTGACC ACCAGCATTG GCCCGCGCCT CGTCGGCTCG
CCGGCCATGG CCCGCGCCAA GGACTGGGGC GTGGCCAAGT TCAAGGCCCT GGGCTTCACC
AACGTCAAGG TCGAGGAATT CGCCAAGCCG TCGTGGACGC GGGGCGAGGA GAGCGCCGAA
CTGGTCGCGC CCTATCCGAT GAAGCTGAGC ATCGTGGGCC TGGGCCGCAC GGTCCCGACC
CCGGCCGCCG GCATCGAGGC CGAGGTCGCG CTGTTCAAGA CCTATGCCGA GCTGATCGCC
GCGCCAGAGA GCGCCGTGAA GGGCAAGATC GTGGTGATCA CCCAGCCGAT GGTCCGCGCC
CAGGACGGTG CCGGCTATGG CGTGGCCGGG ATCTCGCGCC GGTCCGGTCC AGTCGAGGCC
GCCAAGCGCG GCGCCGTCGC CCTGCTGATC CGTTCGGTCT CGACCTCCGA CTCCACCGTG
CCGCACACCG GCGTCACCGC CTTCGGCGAC GGCGTCGTCT CGATCCCCTC GGCCGCCCTG
GGCGTGCCGG AAGCCGAACA GTTGGAGCGC CTAGCCGCCA AGGGTCCGCT GCGCATCAAG
CTGAAGCTGG CGTCAACGAT CGACCCGGCC GACGTGGCCT GGAACATCTC GGGCGAGATC
AAGGGCTCGG AAAAGCCCGA CGAGGTGATC GTCATTGGCG GCCACCTGGA CAGCTGGGAC
GTCGGCACCG GCGCCCTGGA CGACGCCACG GGCATCGCCA TCACCACGGC CGCCGCCAAG
CTGATCGGCG ACCTGCCCAA GCATCCCAAG CGCACTATCC GCGTGGTGAT GTTCGGCTCG
GAAGAAAGCG GCGGCTCGTC GGAGGCCTAT CTGGCCGCCC ACAAGGACGA GGTGTCCAAG
ATCGTCCTGG CCGGCGAGAG CGACAGCGGG GCCGACCGTA TCTACAGCCT GCAGATCCCC
AAGGGTTCGG CGGGGCACCC GGCGATGCAG GCCGCCGCCC GCGTGCTGAC CCCGCTGAAG
ATCTATGTCG ACCGCGCTCC GCCGGCCCAC GCCGGCGCGG ACATCGAGGG ACTGGAAGAA
GCCGGCGTGC CGGTGATCGC CCTGAACCAG GACGCCAGTC GCTACTTCGA CTACCACCAC
ACCATGGACG ACACTCTCAA CAAGGTGCGC CCGGACGAAC TGGCCCAGAA CGTCGCGGCG
TGGGCCAGCT TCCTCTACCT GGTGGCCGAC AGCGACATCG ACTTCCGGAC GCTGAGCGCG
GCCGCTCCTG CGGCTCCGGC GCACTGA
 
Protein sequence
MRLFRTLLTA AAVSALMAGA SHAQDARTAE ALRDKALLDR TAWDVTEDLT TSIGPRLVGS 
PAMARAKDWG VAKFKALGFT NVKVEEFAKP SWTRGEESAE LVAPYPMKLS IVGLGRTVPT
PAAGIEAEVA LFKTYAELIA APESAVKGKI VVITQPMVRA QDGAGYGVAG ISRRSGPVEA
AKRGAVALLI RSVSTSDSTV PHTGVTAFGD GVVSIPSAAL GVPEAEQLER LAAKGPLRIK
LKLASTIDPA DVAWNISGEI KGSEKPDEVI VIGGHLDSWD VGTGALDDAT GIAITTAAAK
LIGDLPKHPK RTIRVVMFGS EESGGSSEAY LAAHKDEVSK IVLAGESDSG ADRIYSLQIP
KGSAGHPAMQ AAARVLTPLK IYVDRAPPAH AGADIEGLEE AGVPVIALNQ DASRYFDYHH
TMDDTLNKVR PDELAQNVAA WASFLYLVAD SDIDFRTLSA AAPAAPAH