Gene Caul_4272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4272 
Symbol 
ID5901733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4642819 
End bp4644477 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content68% 
IMG OID641564791 
Productpeptidase M28 
Protein accessionYP_001685891 
Protein GI167648228 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.43547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCT CCCGTCGCGC CGCCCTCGCC GGCGTCGCCC TGCTGTTGAC CGTCCAACCT 
GTTTTCGCCG CGCCCGTCGC CAAGGCCAAA CCCGCCGCCA CCAAGCCAGC CGCCAGCTTC
ACGCCCTCAC CGGCCGCTAT CAAGGCGCAT ATGAGCTTCC TGGCCGACGA CCTGCTGGAA
GGCCGCGAGT CCGGCACGCG CGGCTACGAC ATCGCCGCCA ACTACGTGGC TTCGCAATAT
GCGGTGATGG GCGTCAAGCC GGCCGGCGAC AACGGCTCCT ACCTGCAGCA GGTGCCGCTG
ACGGCCTATC GCTCGGTCAA CGAGGGCGGG GTGTCCTACA CCACCGCCGA CGGCAAGTCC
GGCGCCCTGA CCTTCGGCGA GGACTACCTG CCGTCCGCCC AAGCCCGCCA GGCCGACACC
TCGGTGACCG CCCCGCTGGT CTTCGTCGGC TACGGGATCA ACGCCCCCGA GCGCGGTCGC
GACGACTATG CCGGCCTGGA CGTGAAGGGC AAGATCGTCG TCCTGCTGAC CGGCGCGCCC
AGCGGCTTCC AGACCGAGAT CCGGGCCCAC TACAGCAACA CCAACGTCAA GCGGGCCGAG
GCGGCCAGGC GCGGCGCGGT CGGGGTCATC ACCCTGCCGA CCACCAGCTC CGAGAAGCGC
CGCCCATTCG AGCGCGGCGT AGCCAACTAC CAGGAATGGA AGATGACCTG GAGCGACGCC
CGGGGCGTGG CCTATGTGCG CGGCGCCGAG GCGCCGGGCC TGGCCACCCT GAGCCTGAAG
GGCGGCGCCA AACTGTTCAC CGGCGCCCCG GTCACCCTGG AAAGCGTGCT GGCCGAGGCC
GAGACGCCGG AAGGTCTGGT CAAGGGCTTC GACCTGCCTG TCAACGTGAC GATCCAGCTG
AAGACCGAGA TCGAGAAGCG CCGGAGCAGC AACGTGGTCG GCCTGATCGA AGGCTCGGAC
CCGACCCTGA AGGCCCAGAC CATCATCCTC AGCGCCCACC TCGACCACCT GGGCATTCAC
GGCAAGGACG CCGACAAGAT CAACAACGGC GCGCTGGACA ACGCCTCGGG CGTCGCGACG
ATGCTGGAGG TGGCCCGAGG CTTCAAGGAA GCCAAGACCA AGCCCAAGCG CTCGATCGTC
CTGCTGGCGG TCACGGCCGA GGAAAAGGGC CTGATCGGCT CGGAATATTT CGCCAACAAC
CCGACCGTGC CGAAGGCCGG CATCGCCGCC GACGTCAATC TGGACATGCC GATCCTGCTG
TACGACTTCC AGGACGTGAT CGCCTTCGGC GCCGACCGCT CGTCGATCGG CCCGGCCGTG
GCCCGCGCCG CTGGCCGCGT CGGCATCGGC CTGTCGGCCG ACCCGCTGCC GGAAGAGGGC
CTGTTCACCC GCTCGGACCA CTATCGCTTC GTCGAGCAGG GCGTGCCGTC GGTGTTCCTG
ATGACCGGTT TCAAGAACGG CGGCGAAAAG GGCTTCAAGG ACTTCCTGGC CACGCATTAC
CACAAGCCCA ACGACGACCT GAACCAGCCG ATCAACTACG AGGCCGGCGC CCGCTTCGCC
CTGGTCAATT ACGAGATCGC CCGCGAGCTG GCCGACATGC CGGCCCGTCC GAGCTGGAAC
AAGGGCGACT TCTTCGGGAC GCTGTTCGGG AAGAAGTAG
 
Protein sequence
MPFSRRAALA GVALLLTVQP VFAAPVAKAK PAATKPAASF TPSPAAIKAH MSFLADDLLE 
GRESGTRGYD IAANYVASQY AVMGVKPAGD NGSYLQQVPL TAYRSVNEGG VSYTTADGKS
GALTFGEDYL PSAQARQADT SVTAPLVFVG YGINAPERGR DDYAGLDVKG KIVVLLTGAP
SGFQTEIRAH YSNTNVKRAE AARRGAVGVI TLPTTSSEKR RPFERGVANY QEWKMTWSDA
RGVAYVRGAE APGLATLSLK GGAKLFTGAP VTLESVLAEA ETPEGLVKGF DLPVNVTIQL
KTEIEKRRSS NVVGLIEGSD PTLKAQTIIL SAHLDHLGIH GKDADKINNG ALDNASGVAT
MLEVARGFKE AKTKPKRSIV LLAVTAEEKG LIGSEYFANN PTVPKAGIAA DVNLDMPILL
YDFQDVIAFG ADRSSIGPAV ARAAGRVGIG LSADPLPEEG LFTRSDHYRF VEQGVPSVFL
MTGFKNGGEK GFKDFLATHY HKPNDDLNQP INYEAGARFA LVNYEIAREL ADMPARPSWN
KGDFFGTLFG KK