Gene Caul_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3986 
Symbol 
ID5901448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4315946 
End bp4317880 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content67% 
IMG OID641564507 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_001685609 
Protein GI167647946 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAACGC TGTTCTTTAC CGCCTCCGCC CTCGCCCTTT CCCTCAGCTT CAGCAGCGCC 
TTCGCCCAGG AGCGCGTTCC GGTCCAGGCC CCGCCGACCC CGCCGATCGC CGCCGCCCAG
GACATCGCCT ATCCCGGCGT GCTGAAGCTG TCGGTCGACG CCACCGACCT GGACCGCAAG
ATCTTCCAGG TTCGCGAGAC GATTCCCGTG GCCAAGGCCG GCCCGATGAC CATCCTGTAT
CCGCAGTGGG TTCCCGGCGG CCACTCGCCG CGCAACGACC TGGACAAGAT GGCCGGGCTG
GTGATCACCG CCGGCGGCAA GACCCTGGCC TGGACCCGCG ACCCGGTCGC CGTCCACGCC
TTCCACTTCG ACGTGCCGGC CGGCGCGACC GAGATCCAGG TCAGCTTCCA GTTCCTGACC
CCGGTGAAGG CCGACGTCGG CCGGATCCTG GTGACCGACG ACATGCTGAA CGTGCAGTGG
CTGCAACTGG GCTTCTACCC GGCCGGCTAC TACACGCGCC GCATCCAGAT CGAGCCGACC
GTCAAGCTGC CGGAGGGATG GGGCTTCGGC ACCGCGCTCG AGAAGGCCTC GACCAACGGG
CAGAGCACCA CCTTCAAGAC CACCACCTTC GAGACCCTGG TCGACTCGCC GATGTTCGCC
GGCCGCTACT ACAAGCAGGT GGACCTGGAC CCCGGCGCGG CCACGCCGGT GCGCCTGAAC
ATCGTCGCCG ACAAGCCCGA GCTGCTGGAA ATCAAGCCCG AGGCCCTGCA GATCCACCGC
AACCTGGTGC AGCAGGCCTA CAAGCTGTAT GGCGCGCACC ACTACGATCA CTACGACTTC
CTGCTGGCCC TGACCGACAA GATGGGCGGC ATCGGGCTTG AGCATCACCG CTCCAGCGAG
AACGGCGTCA CCCCCAAATA CTTCACCGAC TGGGAAAAGA CCTTCGTCGG CCGCGACCTG
CTGGCCCACG AATACACCCA CTCGTGGAAC GGCAAGTTCC GCCGCGCGGC CGACCTCTAC
ACCCCGACGC TGAACGAGCC GATGCGCGAC AGCCTGATGT GGGTCTATGA GGGCCAGACC
CAGTACTGGG GCAATGTGCT GGCCTCGCGC TCGGGCCTGC AGACCAAGCA GCAGGGCCTG
GACAGCCTGG CCATGACCGC CGCCCTCTAC GACACCCGGG CCGGCCGCAA CTGGCGCAAC
GTGCTGGACA CCACCAACGA CCCGATCATC GCCAACCGCA AGCCGGCCTC GTGGACCAGC
TGGCAGCGCA GCGAGGACTA CTATTCGGAA GGCCAGCTGG TCTGGCTCGA CGCCGACACC
CTGATCCGCG AGAAGACCGG CGGCAAGAAG TCGCTGGACG ACTTCGCCAA GGCCTTCTTC
GGTGTCGAGA ACGGCTCGTA CGTGCCGCTG ACCTACGACT TCGACACGGT CGTGAAGACC
CTGAACGGCG TCGTGGAGAA CGACTGGGCC ACCTTCCTGA AGACCCGCAT CGAGGGCCTG
TCCGAGCACG CCCCGCTCGA TGGCCTGACG CGCGGCGGCT ACAAGCTAGT CTATACCGAC
ACGCCCACCG AGTTCTTCAA GGCGGCCGAG ACGCGCGGCA AGATCGTCAA TCTCAGCTAC
TCGCTGGGGA TCACGATCGG CAAGGACGGC CTGCTGTCGG CGGTCAACTG GGACACCCCG
GCCTTCAAGG CGGGCCTGAC GGCCGGCGAG ACCATCGTCG CGGTCAACGG CACCGCCTAT
GGCGACGACC TGATCAAGGA CGCGGTCAAG GCCACGGCCA AGGCCGACGC CCCGGTGGTC
GAACTGCTGG TCAAGGACGG CGAGCGCTAT CGCACCGTCA AGATCGACTA CCACGGCGGC
CTGAAGTACC CGCGCCTGGA GCGGATCGAG GGCACGCCGG CGCGGCTGGA CGAGATCTAC
ACGGCGCGCA AGTAG
 
Protein sequence
MKTLFFTASA LALSLSFSSA FAQERVPVQA PPTPPIAAAQ DIAYPGVLKL SVDATDLDRK 
IFQVRETIPV AKAGPMTILY PQWVPGGHSP RNDLDKMAGL VITAGGKTLA WTRDPVAVHA
FHFDVPAGAT EIQVSFQFLT PVKADVGRIL VTDDMLNVQW LQLGFYPAGY YTRRIQIEPT
VKLPEGWGFG TALEKASTNG QSTTFKTTTF ETLVDSPMFA GRYYKQVDLD PGAATPVRLN
IVADKPELLE IKPEALQIHR NLVQQAYKLY GAHHYDHYDF LLALTDKMGG IGLEHHRSSE
NGVTPKYFTD WEKTFVGRDL LAHEYTHSWN GKFRRAADLY TPTLNEPMRD SLMWVYEGQT
QYWGNVLASR SGLQTKQQGL DSLAMTAALY DTRAGRNWRN VLDTTNDPII ANRKPASWTS
WQRSEDYYSE GQLVWLDADT LIREKTGGKK SLDDFAKAFF GVENGSYVPL TYDFDTVVKT
LNGVVENDWA TFLKTRIEGL SEHAPLDGLT RGGYKLVYTD TPTEFFKAAE TRGKIVNLSY
SLGITIGKDG LLSAVNWDTP AFKAGLTAGE TIVAVNGTAY GDDLIKDAVK ATAKADAPVV
ELLVKDGERY RTVKIDYHGG LKYPRLERIE GTPARLDEIY TARK