Gene Caul_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4571 
Symbol 
ID5902032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4946560 
End bp4947855 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content70% 
IMG OID641565090 
Producthypothetical protein 
Protein accessionYP_001686189 
Protein GI167648526 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.775043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTTT CCGATCCGTT CCTGATCCTC GCCATCGTCT TCGCCCTGCT GGCCGCCGGC 
GCCGTGCTGT GGGCCCTGGC CAGCCAGCGG CGCGCGACCG GCGCCGACGC CCGCGCGTGG
GAGCTGAACG CCAGGCTGGT CCAGGCCGAC GAGCGGGCGC GGCTGCTGGA AGACCAGGCC
GTCACCCAGG GCGAGCTGAT CCGCGCCCAG GCCGCCCAGC AGGCGACCAT GACCGCCAAC
ACCGTCGCCG AGGCGCTGAT CAAACGCACC GAGGAGAACT TCAAGAGCCG CGAGGCGCTG
TCCCAGGCCC GGCTCGAAGC TCAGTTGAAG CCGGTGGCCG AGACCCTGGC CAAGTTCGAG
GCCCAGGTCA CCGCCGTCGA AAAGGCCCGC GCCGAGGAGA CCGGCGGCCT GAAGGCCCAG
ATCAACGCCC TGATGGAGGC CTCGGTCGCC ACCCAGTTCG AGGCCCGCAA GCTGTCGGCC
GCCCTGCGGC GCGGGGCCGG GGTCCAGGGC CGCTGGGGGG AGCAGACTTT ACGTAACGTT
CTCGAGGCCG CCGGCCTCAA CAACCGCTTC GACTTCGAGG AGCAGTTCAG CGTCGAGAGC
GACGAGGGCC GTCGTCGTCC CGACGTCAAG GTCAAGATGC CGGGCGGCGG GGTGTTCGTG
ATCGACGCCA AGTGCTCGCT GAACGCCTTC CTCGAGGCCC AGGAAGTGAC CGAGGAGCAC
CTGCGCGAGG CGGCCATGAT CCGTCACGCC GCCAGCGTCC GCGCCCACAT GCAGGGTCTT
TCCGCGAAGG CCTATTGGGA CCAGTTCGCC GGCGAGGGCT CGCCCGACTT CGTGGCCATG
TTCGTGCCCG GCGACGGATT CCTGGCCGCC GCCCTGGACC GCCTGCCCGA CCTGATGACC
GAGGCCATGG ACCGCCGGGT GCTGCTGGTC ACCCCGACCA CCCTGTTCGC TCTCTGCAAG
GCCGTCGCCT ATGGCTGGCG GGCCGAGGAC CAGGCCAAGA ACGCCGCCGC CATCGTCGCG
GTGGGCCGCG AGCTCTATAA GCGCATCGCC GTGATGGGGG CCCATGCCGG CTCGGTGGGC
AAGGCGCTGG AGGCTGCCGT CGGCCGCTAC AACCAGTTCG TCGGCTCGCT GGAAAGCCAG
GTCCTGACCC AGGCTCGCCG CTTCGAGGAC CTGTCGGTGG ATCACGAGGG CAAGGAGATC
GGCGAGCTGG CCCCGGTCGA GAACGCCGTG CGGCCGCTGG TCAAGCTGGC CGAGGCGCCG
GCGGAGCCCG TGGCTCGCCT GCAGGCCAAG CCTTAG
 
Protein sequence
MNFSDPFLIL AIVFALLAAG AVLWALASQR RATGADARAW ELNARLVQAD ERARLLEDQA 
VTQGELIRAQ AAQQATMTAN TVAEALIKRT EENFKSREAL SQARLEAQLK PVAETLAKFE
AQVTAVEKAR AEETGGLKAQ INALMEASVA TQFEARKLSA ALRRGAGVQG RWGEQTLRNV
LEAAGLNNRF DFEEQFSVES DEGRRRPDVK VKMPGGGVFV IDAKCSLNAF LEAQEVTEEH
LREAAMIRHA ASVRAHMQGL SAKAYWDQFA GEGSPDFVAM FVPGDGFLAA ALDRLPDLMT
EAMDRRVLLV TPTTLFALCK AVAYGWRAED QAKNAAAIVA VGRELYKRIA VMGAHAGSVG
KALEAAVGRY NQFVGSLESQ VLTQARRFED LSVDHEGKEI GELAPVENAV RPLVKLAEAP
AEPVARLQAK P