Gene Caul_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3592 
Symbol 
ID5901047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3876652 
End bp3878598 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content70% 
IMG OID641564102 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_001685217 
Protein GI167647554 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.757001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTAG ACGGAACCGT CCTGCCCATG CCCTCTCTCA AGACCACCAC CCGCGCCGCC 
CTGCTGGCGC TCCTGCTCTG CGGCGCGGCC GCCGCGCCCG TCCTGGCGCA AACCGCTCAA
CCCCCGATCC CGGCCATCCT GATGACGCCC GAGGCCCGCG ACATCCACTC CTACGCCCAG
CCGCTGGTGG CCCGGGTCAC CCATGTCGAC CTGGACCTGA CCGCCGACTT CGCCGGCCAG
AAGATGACCG GCACGGCCGC CCTCGACATC GCCGCCGCGC CGGACGCCGA GGAGGTGGTG
CTCGACAGCA AGGGCCTGGT GATCCACGGC GTCACCGACG ACAAGGGCGC GGCCCTGCCG
TGGACCCTGG GCAAGGCTGA CCCGATCCTG GGCGCGCCGC TGACGGTGCA GCTGCCCAAG
GGGGCCGGAG CCGCCAAGCG CATCGTCATC AGCTATGACA GCGCCCCCGG CGGCGCGGCC
CTGCAATGGC TGACCCCGGC CCAGACGGCG GGCAAGATCA AGCCCTATCT GTTCAGCCAG
GGCGAGGCGA TCCTCAACCG CACCTGGATC CCCACCCAGG ACAGCCCGGG CGTCCGCCAG
ACCTGGACCG CCCGCATCGT CGCGCCCGAG GGCCTCAAGG CCGTCATGAG CGCCGAGATG
CTGACCCCCA ACGGCGAGCC CGTCGCTGGC GGCCGCGCCT ATCGCTTCAA GATGGACAAG
CCAGTCGCCT CGTACCTGAT CGCCATCGCC ATCGGCGACA TCGCCTTCAC CCCGCTGGGC
CAGCGGACCG GCGTCTACAC CGAGCCGTCG GTGATGAAGA AGACCGCCAA CGAACTGGTC
GATGTCGAGA AGATGGTCGA GGCCGCCGAG AGCCTCTACG GCCCCTATGC CTGGGGCCGC
TACGACCTGC TGGTCCTGCC GCCGTCGTTC CCGTTCGGCG GCATGGAGAA CCCCCGCCTG
ACCTTCGCCA CGCCCACGAT CATCGCCGGC GACCGCTCGC TGGTCAGCCT GGTGGCGCAT
GAGCTGGCCC ACTCGTGGTC GGGCAACCTG GTGAACAACG CCACCTGGTC GGACTTCTGG
CTGAACGAGG GCTTCACCGA CTATTTCGAA AACCGGATCA TGGAGAAGCT CTACGGCAAG
GACCGCGCCG ACATGCTGGC CGATCTGGGC TGGAGCGACC TGCAGGGCGC GATCAAGGAC
GCCGGCGGGT TGAGCGGCGC CGACACCCGC CTGCACCTGG ACCTGACCGG CCGCGATCCC
GACGACGGCA TGACCGACAT CGCCTATCAG AAGGGCGCGA CCTTCCTGCG CACCATCGAA
AAGGCGGTCG GCCGCGCGCG CTGGGACGCC TATCTCAAGG CCTATTTCGC CCGGCACGCC
TTCCAGAGCC AGACCACGGC CGGCTTCGTG GCCGACCTGC GCGAGAACCT GATCAAGGGC
GACCCGAAGC TCGAAGCCGC GATCGGCATC GACAAGTGGG TCTATGACGT GGGGCTGCCG
GACAACGCCG TGCACATCCA TTCTGCGGCC TTCCCGGCGG TGGACGCCTT GGCCGCCGCC
TACGCCAAGG GCGGCCCCGC GCCGATCGCC AGGTGGAAGG CCTGGAGCAC GCCCGAGCGC
ACGCGCTTCA TCGCCAGCCT GCCCCGCGCC CTGCCGAAGG CGCGCCTGGC CGCGCTCGAC
AAGGCCTTCG GCCTGTCGGC CCAGGGCAAC AGCGAGATCC GCTTCGTCTG GCTGGAACTG
GCCGTCGCCA ACCGCTACGA CCCCGCCATG CCGTCTCTGC AGGCCTTCCT GACCGACCAG
GGCCGCCGCA AGTTCGTCGC CCCGCTGTTC AAGGACCTGA TGGCCCAGGG CGACTGGGGC
CAGCCGATCG CCAAGGCGCT CTACGCCAAG ACCCGGCCGC TCTATCACGC GGTCACGCGC
CAGACGGTCG ACGGGATCGT GAAATAG
 
Protein sequence
MSVDGTVLPM PSLKTTTRAA LLALLLCGAA AAPVLAQTAQ PPIPAILMTP EARDIHSYAQ 
PLVARVTHVD LDLTADFAGQ KMTGTAALDI AAAPDAEEVV LDSKGLVIHG VTDDKGAALP
WTLGKADPIL GAPLTVQLPK GAGAAKRIVI SYDSAPGGAA LQWLTPAQTA GKIKPYLFSQ
GEAILNRTWI PTQDSPGVRQ TWTARIVAPE GLKAVMSAEM LTPNGEPVAG GRAYRFKMDK
PVASYLIAIA IGDIAFTPLG QRTGVYTEPS VMKKTANELV DVEKMVEAAE SLYGPYAWGR
YDLLVLPPSF PFGGMENPRL TFATPTIIAG DRSLVSLVAH ELAHSWSGNL VNNATWSDFW
LNEGFTDYFE NRIMEKLYGK DRADMLADLG WSDLQGAIKD AGGLSGADTR LHLDLTGRDP
DDGMTDIAYQ KGATFLRTIE KAVGRARWDA YLKAYFARHA FQSQTTAGFV ADLRENLIKG
DPKLEAAIGI DKWVYDVGLP DNAVHIHSAA FPAVDALAAA YAKGGPAPIA RWKAWSTPER
TRFIASLPRA LPKARLAALD KAFGLSAQGN SEIRFVWLEL AVANRYDPAM PSLQAFLTDQ
GRRKFVAPLF KDLMAQGDWG QPIAKALYAK TRPLYHAVTR QTVDGIVK