Gene Caul_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1274 
Symbol 
ID5898729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1335247 
End bp1336359 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content71% 
IMG OID641561759 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_001682902 
Protein GI167645239 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00848687 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCTCGA CCCTGCGTCC CGTCGTCCTG TTGCTCGGCT GCGTGATGGC CACATCGGCC 
TGTTCAGACC CCGCGCCCCC CCAGGAAAAG CCGGCCGCGT CGAAGCCGGC GGCGGCTCCA
GCCCCGGTTG CGCCGCCCCT GGTCGCCGCG TCGCCGGCTC CCGCGCCGGC CTCCCCCTCG
CCGATGGCCC AGGCGGTCAA CGCCGCCGCC TTCGATCCCG CCGCCACGAC GCCGGAGGCC
AAGCAAGCCT ATCTGACCCG CGCCCAGGTT CTGCTCGACC GCGCCCACTT CTCGCCCGGC
GTCATCGACG GCCAGGAGGG CTCGAACCTG ACGCTGGCGC TCAGCGCCTT CCAGGAGGCC
AACCGCCTGA CCGTCGACGG CAAGCTCAGC CCCGCCGTCT GGGACGCCCT GGCGGCTGAC
AGCGCGCCGG CCCTGACAGA CTATCTCATC ACACCAGAAG ACGTCGCCGG CCCGTTCACG
CCCGACATTC CCAAGGACGA CTACGAGGCG ATGGCCAAGC TGCCAGCGCT GGGTTACGGC
ACGCCGCTGG AGGCCCTGGC CGAGAAGTTC CACATGGACG AGCCGCTGCT GCAGGCCCTG
AACCCCGGCG TCGACTTCTC CAAGGCCGGG ACCACCATCA TCGTCGCCGC GCTTGGTCCC
GAGGGTTTGA GCGCCGAGGG CCTGGACGGC AAGGTCACCC GCATCGAGAT CGACAACGCC
AGGGGCGTGC TCAAGGCCTA TGCCGACGGC GACAAGCTGC TGGCGGTCTA TCCGGCCACG
GTGGGCAGCA CCGAGCGCCC GGCCCCGGTC GGCGAGTGGG CGGTCAACAC CGTCGCGCCG
CGCCCGACCT ACACCTATGA CCCCACGCGC CTGACGTTCG GCAAGCCGAC CGGCAAGCTG
ACCCTCAAGG CCGGGCCGAA CAATCCGGTG GGCTCCACCT GGATCGACCT GACCAAGGAC
ACCTACGGCA TTCACGGCAC GCCCGATCCG CGCCTCGTGA ACAAGCGCGC CTCGCACGGC
TGCGTGCGGC TGACCAACTG GGACGCGGCC GAGCTGGGCA AGGCGGTGGT GAAGGGGGCC
AAGGTGGTGT TCGAGGGCAA GCCGGTGCGG TGA
 
Protein sequence
MPSTLRPVVL LLGCVMATSA CSDPAPPQEK PAASKPAAAP APVAPPLVAA SPAPAPASPS 
PMAQAVNAAA FDPAATTPEA KQAYLTRAQV LLDRAHFSPG VIDGQEGSNL TLALSAFQEA
NRLTVDGKLS PAVWDALAAD SAPALTDYLI TPEDVAGPFT PDIPKDDYEA MAKLPALGYG
TPLEALAEKF HMDEPLLQAL NPGVDFSKAG TTIIVAALGP EGLSAEGLDG KVTRIEIDNA
RGVLKAYADG DKLLAVYPAT VGSTERPAPV GEWAVNTVAP RPTYTYDPTR LTFGKPTGKL
TLKAGPNNPV GSTWIDLTKD TYGIHGTPDP RLVNKRASHG CVRLTNWDAA ELGKAVVKGA
KVVFEGKPVR