Gene Caul_4644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4644 
Symbol 
ID5902106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5021226 
End bp5023175 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content69% 
IMG OID641565163 
ProductAsmA family protein 
Protein accessionYP_001686262 
Protein GI167648599 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0752121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCTTCG ACTGGAACTG GTTTCGCGGG CCGGTCGGCC GCATGGCTTC GGCGCGCCTG 
AACCGCCAGG TGGTGATCAC CGGTGATCTG CGCGTCCATC CCTGGTCGTT CTCGCCGAAG
GTCGAGGCCT ATGGCGTGCG CATCGCCCAG CCGGACTGGG CGGCCAAGGC CGACCCCAAG
GGTTCGGCGG CCGGGTCGCA GATGGCCAAG GTCGATCGCA TCGCCGTGCA GATCAGGATC
CTGCCGCTGC TGAAGGGCGA GGTGATCCTG CCGCTGCTGG CCATCGACCG TCCCGACGTG
CGCTTGCTGC GCGACGCCTC TGGCCGCGCC AACTGGACGT TCGGCGATCC CAACGCCCCC
AAGACGCCGC TGAAGCTGCC GGCGATCCAG CACTTCATCA TCAATGACGG CCAACTTCGC
TACGAAGACC AGCAGCGCGA CGTCTTCTTC CTGGGCTCGG TCAGCTCCAA CGAGCACGCC
ACCACCGATG GCCGCGGCAA GTTCGTGCTG GAGGGCAAGG GCCAGTTGAA CCGCGCGCCG
TTCACGGCCC TGGTCACCGG CGGTCCGCTG CTGAACATCT CGCCCAACCG CCCCTATCCG
TTCGACGCCC AGATCGACGC CGGCTCGACC CACGTGACGG CCAAGGGCGA CATCACCAAG
CCGTTCGACC TGGGCCGCTT CGAGACCCAG GTCAGCGTGT CGGGCACGGA CCTGAACCGG
CTCTACGCCC TGACCGGCCT GGCCCTGCCC AACACCCCGC CCTACAAGAT CAGCGGCCAC
CTGTCCCGCA AGGGCGAGCG GTTCGACTTC AACAAGCTGA CCGGCCGGGT CGGCGACAGC
GACGTCTCCG GCGACCTGTT CGTGCTGATG GGCAAGGCGC GGCCCTATCT GGAAGCCGAC
CTGCAGTCGC AGCGGCTGGA CTTCGATGAC CTGGGCAGCC TGGTCGGCGC CGCGCCGGGC
ACCGGCAAGG GCGAGACCGC CTCGGCCGGC CAGAAGGTCG AGGCCGGCAA ACGCGACGCC
ACCCAGCGCC TGCTGCCCGA CGCCACCCTG CAGGTCGAGC GCGTCAAGGC CATGGACGCC
AAGGTCAAGT ACCGCGCCCT GGCGGTGAAC GCCCCGAACC TGCCGCTGAA GAAGGTGCGT
GCGGAGCTGA CGCTGGAAAA GGGCGTGCTG ACCCTGGATC CGATCGCCTT CACCTTCTCG
CGCGGCGATC TGACCGGCAA GATACGGCTG GACGCCAATC CGGCGACCCC GCGCACCGAT
CTGGACCTGC GCCTGACCAA CGCCAAGCTG GAGGACTTCA TCCCGATCCA GAGCGGCGGC
AAGCCGGCGA TCGAGGGCGG GGTGATGGCC CGGGCCAAGC TGACCGGCTA CGGCAATTCC
GTCCACCGCG CCGCCTCGAC CGCCAATGGA CAAGTGACGC TGGTGTCGCC CAAGGGCGAA
ATCCGCCAGG CCTTCGCCGA ACTGCTGGGG GTCAACGCCT CCAAGGGCTT GATCCTGCTG
CTCAGCAAGG ACAACCGTGA AACCAGCGTC CGCTGCGCCG TGGCCGACTT CACCGTCAAG
GACGGGGTCA TGCGCGCCAA CCAGATCGTC GCCGACACCG GCGTGGTGCT GGCCAAGGGC
AAGGGCACGA TCGACCTGCG GAGCGAGCGT CTGGACCTGA AGATCAACGG CGACAGCAAG
AAGCCGCGGC TGGTCCGCCT GTTCATTCCG ATCACCATCA AGGGTCCGTT CCTGGCGCCG
AAGGTGGGGC TGGACACCGG CGGAGCCTTG GCTCAGGGCG GCGCCGCCGT CGCGCTCGGC
GCGCTGCTCT CGCCCTTGGC CGCCATCCTG CCCTTCGTCA CCGGCGGCGA AGCCAAGGAC
GCCGACTGCG CCAGCCTGGT CGCCGAGGCC CGCGGCGACG GCGCTCCGGT CAAGGTGGCC
CAGACCACCC CGGCCAAGGT GAAGAAGTAG
 
Protein sequence
MVFDWNWFRG PVGRMASARL NRQVVITGDL RVHPWSFSPK VEAYGVRIAQ PDWAAKADPK 
GSAAGSQMAK VDRIAVQIRI LPLLKGEVIL PLLAIDRPDV RLLRDASGRA NWTFGDPNAP
KTPLKLPAIQ HFIINDGQLR YEDQQRDVFF LGSVSSNEHA TTDGRGKFVL EGKGQLNRAP
FTALVTGGPL LNISPNRPYP FDAQIDAGST HVTAKGDITK PFDLGRFETQ VSVSGTDLNR
LYALTGLALP NTPPYKISGH LSRKGERFDF NKLTGRVGDS DVSGDLFVLM GKARPYLEAD
LQSQRLDFDD LGSLVGAAPG TGKGETASAG QKVEAGKRDA TQRLLPDATL QVERVKAMDA
KVKYRALAVN APNLPLKKVR AELTLEKGVL TLDPIAFTFS RGDLTGKIRL DANPATPRTD
LDLRLTNAKL EDFIPIQSGG KPAIEGGVMA RAKLTGYGNS VHRAASTANG QVTLVSPKGE
IRQAFAELLG VNASKGLILL LSKDNRETSV RCAVADFTVK DGVMRANQIV ADTGVVLAKG
KGTIDLRSER LDLKINGDSK KPRLVRLFIP ITIKGPFLAP KVGLDTGGAL AQGGAAVALG
ALLSPLAAIL PFVTGGEAKD ADCASLVAEA RGDGAPVKVA QTTPAKVKK