Gene Caul_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3224 
Symbol 
ID5900679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3484486 
End bp3485496 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content71% 
IMG OID641563729 
Producthypothetical protein 
Protein accessionYP_001684849 
Protein GI167647186 
COG category[S] Function unknown 
COG ID[COG4093] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0804136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATA ACGCCGCCGC TCTGTCCCGC AAGCCCCGGC GTCGCGGTCT GCTGGCGCCT 
TTCGTGGTGT TGGCATTAGT TGCTCTGGGG TGGAGCGCCG GCTGGGTGTG GCTGCGCGGC
CAGGCCGAGC AGCGAATGGA CGCCACCGCC CTGTCGCTGA AATCGCGCGG CTACGACCTG
TCGTGGGACG TCCGGACCTT CAGCGGCTAT CCGTTCCGCA TGGACGTGCG CCTGACCAAC
GCCCGGGTCG CCGAGCCGTC GGGCTGGGCG TTGCGGGCGC CGGAGCTGAC GGGCGAGGCC
ATGGCCTACG ACATCGGTCA CTGGGTGGTC GTCGCCCCGG CCGGCGTGGT CATGACCCGG
CCGATCAATG GCGACGTGGC GATCACCGGC CAGGCGCTGC GGGCCAGCTT CGCCGGCTTT
GACAAGTACC CGCCGCGCAT CTCGGTGGAG GGCGCGAACC TGATCTTCAC CACCGCCCCT
GGCGTCGCGC CCTTCCCGCT GCTGTCGACG GCGGGGCTGC AACTGCACAT CCGCCCCGGT
CCGGACGACC AGGGCGCGAT CTTCTTCGAG GCCAAGGGCG CCAAGGCCCG CTTCACCGGC
CTGATGGGCC GGATTGCCGA GGATCGCACC GCCGACCTGA TCTGGGATTC CAAGATCAGC
AAGGTCAGCG CCCTGCGCGG CAGAAACTGG GCCGACGCAG TGGGCGACTG GTCCAAGGCC
GGCGGAACCC TGACCGTGCA GCAGGGCAAG CTCAACGCCG GCGAGGCGCT GCTGGAAGCC
AAGTCCGGCG CCCTGACCGT CGGCGACGAC GGCCGCCTGC AGGGCGCGCT CGACGTCACC
GTGCGCGAGG TTCCCTCGCC CGGCGAGGCG CTGAAGAGCC CCGACGCCGC CGCGGCGGCC
GTCGCCCAGG CGCTCGGCCG CGACCCGACC CTGTCGGCCA CCCTGAAGTT CGAAAATGGC
CGCACCCGGC TGGGACTGTT CGACACCGGG CCTTCGCCGC GGGTTTATTG A
 
Protein sequence
MTHNAAALSR KPRRRGLLAP FVVLALVALG WSAGWVWLRG QAEQRMDATA LSLKSRGYDL 
SWDVRTFSGY PFRMDVRLTN ARVAEPSGWA LRAPELTGEA MAYDIGHWVV VAPAGVVMTR
PINGDVAITG QALRASFAGF DKYPPRISVE GANLIFTTAP GVAPFPLLST AGLQLHIRPG
PDDQGAIFFE AKGAKARFTG LMGRIAEDRT ADLIWDSKIS KVSALRGRNW ADAVGDWSKA
GGTLTVQQGK LNAGEALLEA KSGALTVGDD GRLQGALDVT VREVPSPGEA LKSPDAAAAA
VAQALGRDPT LSATLKFENG RTRLGLFDTG PSPRVY