Gene Caul_1450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1450 
Symbol 
ID5898905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1541929 
End bp1544922 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content71% 
IMG OID641561937 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_001683078 
Protein GI167645415 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0762725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG CGGCGCCATG GAGCGTTAAG GGGATAGACC CCAAGGCACG GGAGGTCGCG 
AAAGACCTCG CGCGTCGCTC CGGCATGACG CTGGGCGAGT GGCTCAATCG CATGATCATC
GAGGGCGAGG GCGTCGATGT CGCCGCCCTG GAAGCCTTCG GCGAACCCAG CCGTCCATCC
CTCGCCGTCA ACAACGAACG CCCCAATACT TCCTACTATG AGACCGCTCG GGGCGCCGCG
CCGTCGCGTA TCGAAGTGCG CGAGCATCCG GCCGACGAGG TCGGTCGCGT CGCCATCGCC
CTGGACCGGC TGACCGACCG CATCGAGTCC GCCGAGAAGC GCTCGGCCCA GGCCATTTCC
GGAATCGACG AATCCGTGCG CGGCGCCTTG CAGCGGCTGG CCACGGCCGA GCGCGAGCAG
GTCGCCGTCG CCGCCCGCTT CGAGGGCGCG GTCGACGAGA TCAAGACCGA GCAGATCCGC
GGCGCCGAGC GCCTGCGCCG CATCGAGAAC GAGGCCGCCG GCCCGCGCTC GGCCGAGGCG
CTGCGGGCTC TGGAAGGCGC GCTCGGCAAG GTCGCCGGCC ATCTCTATGA AGGCGAAGCC
CGCACCCGCG AGACGATCGC TCATCTCGAA CAGCGTCTCG ACAATCAAGC CGGCGTGGGC
GCGGGCGATC CGTCGGCCCT GGTCGACGAA GTCGTGGCCC GACTGGGTGA GCGTCTGGAA
GCCGCCGAAT CCCGCACCGC CGAGGCGCTG CAGGCCCTGG GCGCCTCTTT CACCGCCCTG
GACGGCCGCC TGCGGACCGT CGAGACCAGC AATCCTGGCG AAGGCGTGCA GAAGCGCCTG
GACGAGCTGT CCAACGGCCT GACCCAGCGC ATGGAGGCCG CCCGCATGGA AATGGCGGCC
AAGCTGCGCC AGTCCGCCGA CGGTCGCTTC GACCGCATGG AGCTCAAGCT CGGCGAAATG
ACCGCCCACG TGCAGGCCGC CGAGCAGCGC TCGGCCCAGG CCATCGAGCG CATGGGCCGG
GAAGTGGTCG GCATGGCCGA CGCCCTGAAC CGCCGCGTCC AGACCTCGGA AGCCCGCAAC
ACCGCCGCCA TCGAGCAAGT CGCCGCCAGC ATCGACCAGA AGCTGAACCG CGCCGACAGC
GTTCAGGCCC AGGCTCTCGA GAAGCTGGGC ACGGAAATCG CCCGGATCAC CGAGAAGCTG
GCCGAGCGGA TCAGCAACGC CGAGCGCCGT AACGCCCTGG CCATCGACGA CGTCGGCGAC
CAGGTCACCC GCGTCACCGA TCGCCTGAAC CAGCGTCACG AGCGCACCTC GCAGGAACTG
GTCGACCGCA TCCGCCAGAG CGAGGAGCGC ACCGCGCGCA TGCTCGACGA AGCGCGCGAG
AAGATCGACG CCCGGCTTGC CGAAGCTCAA CGCAAGCTGG CCGAACAGGC CGCCGCGACC
GCGGCCCCAG TCACGCGGTC CGACCCGTCG CCGTTCGACG GCGAGCGCTA TTCGTTCGGG
GGCGAGGCCG TCCCCGAGGA CGTGTTCGAC CAACCCGCCG CCTTCATCCC GGCTCATGCA
TCGACGCCGG CCCGCACCGC CGCGCCCGCC GCCTTCGAAG CCCCGGTCTA CGAGGCCCCG
ACCTTCCCGG CCTCCGAGCC CGCCGAGCCC GAGTTCGGCG AAGAGGACTT CGAGGCCGCC
GACGGCTTCG TCTCCGCGCC CGAGCCGGAG ACGGCGTTCG AGCCGGACGC CGACTACGAT
ACCGATTTCG CGGCCCCCGA CTTCGCAGCC TCCGAGTTCT CCGCCCCCGA AGAGCCGGCC
GCGCCGTCGC GTCCACTGTC GACGCGCGAA GTGATCGAGC AGGCCCGGGC CGCCGCCCGC
CAAGCCGCCT TGGCCGGCGA GACCAAGGGC AAGGACAAGC CGAAGACCAA GAAGGTCGGA
ACCTCGCTGT TCTCCGGCTT CGGCGCCAAG AACGCCGAGA AGAAGGCCAA GAAGGGCGCT
ATGAAGACCG CCCTGATGGT CTCGGCCACC GCCGCCTTCA TGGGTGTCGG CGCGGCCGGC
GTGATCATGA GCATGGGTCA CGGCGGCGGA CCGCTGCCGG AGCGTGTCGC TCAGGGCACG
GCCAACAAGG CGACCGGCGA AATCCGCGCC ATCGAAACCG ACACCAGCCC GGCCGGCGCC
CGCGCCGCCG TCGCCCTGAC CTCGCAGGTC CCGCTCGACG CCACCGCCCT GCCGGTCGAG
GCGTCCACCG TCACGCCGCC CTCCGAGAAC GCCCAGGCCC TCTACGACGA CGGCGTGCGC
CGCATCGAGG CCAAGGACCG CACCGGCCTG GAGCCGATGC GCAAGGCCGC CAACCTCGGC
CTGCCGCGCG CCCAGTTCTA CCTGGCCAAG ATGTACGAAG TCGGCGAGGG CGGGGTGAAG
AAGGACCTCG TTGAAGCCCG CCGCTGGACC GAGCGCGCCG CCACGGCCGG CGAAGCCCGC
GCCATGCACA ATCTGGGCCT CTACTACTAC AAGGGCGACG GCGGCGAACG GAACTCGACC
AAGGCCGCCA GCTGGTTCCG CAAGGCCGCG GACCTGGGTC TGGTCGACAG CCAGTTCAAC
CTGGCTCAGC TCTACGAATT CGGGCGCGGC GCCGACCAGA ATCCGACCGA AGCCTACAAA
TGGTACCTGA TCGCCGCCAA GAACGGCGAC ACCAGCGCCG GCGCTCGGGC CATGGCCCTG
CGCAGCCAAC TGACCCCCGA GGGACAGCGG ACGGCCGAAC GCTCGGCCTA TGGCTTCCGC
TCGCAGGCCG CCGCGGCGCC GCTGCAGACC GCCAGCGTCT CGGCCGCCAG CGCCGGCCTG
GCCACCGCCC AAAAGGCCCT GTCCAGGCTC GGCTACTACC AGGGACCGCA GGACGGCGTC
GCCTCTCCGG CCTTGCGGAT GGCCATCGCG GCTTACCAGC GTGATCAGAG TCTGCCGACC
TCGGGAGCGC TCGATGGCGA GACCCTCAGC CGCCTGGCGA CCTGGGTCCG CTAG
 
Protein sequence
MTAAAPWSVK GIDPKAREVA KDLARRSGMT LGEWLNRMII EGEGVDVAAL EAFGEPSRPS 
LAVNNERPNT SYYETARGAA PSRIEVREHP ADEVGRVAIA LDRLTDRIES AEKRSAQAIS
GIDESVRGAL QRLATAEREQ VAVAARFEGA VDEIKTEQIR GAERLRRIEN EAAGPRSAEA
LRALEGALGK VAGHLYEGEA RTRETIAHLE QRLDNQAGVG AGDPSALVDE VVARLGERLE
AAESRTAEAL QALGASFTAL DGRLRTVETS NPGEGVQKRL DELSNGLTQR MEAARMEMAA
KLRQSADGRF DRMELKLGEM TAHVQAAEQR SAQAIERMGR EVVGMADALN RRVQTSEARN
TAAIEQVAAS IDQKLNRADS VQAQALEKLG TEIARITEKL AERISNAERR NALAIDDVGD
QVTRVTDRLN QRHERTSQEL VDRIRQSEER TARMLDEARE KIDARLAEAQ RKLAEQAAAT
AAPVTRSDPS PFDGERYSFG GEAVPEDVFD QPAAFIPAHA STPARTAAPA AFEAPVYEAP
TFPASEPAEP EFGEEDFEAA DGFVSAPEPE TAFEPDADYD TDFAAPDFAA SEFSAPEEPA
APSRPLSTRE VIEQARAAAR QAALAGETKG KDKPKTKKVG TSLFSGFGAK NAEKKAKKGA
MKTALMVSAT AAFMGVGAAG VIMSMGHGGG PLPERVAQGT ANKATGEIRA IETDTSPAGA
RAAVALTSQV PLDATALPVE ASTVTPPSEN AQALYDDGVR RIEAKDRTGL EPMRKAANLG
LPRAQFYLAK MYEVGEGGVK KDLVEARRWT ERAATAGEAR AMHNLGLYYY KGDGGERNST
KAASWFRKAA DLGLVDSQFN LAQLYEFGRG ADQNPTEAYK WYLIAAKNGD TSAGARAMAL
RSQLTPEGQR TAERSAYGFR SQAAAAPLQT ASVSAASAGL ATAQKALSRL GYYQGPQDGV
ASPALRMAIA AYQRDQSLPT SGALDGETLS RLATWVR