Gene Caul_1508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1508 
Symbol 
ID5898963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1603608 
End bp1604708 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID641561995 
ProductDNA protecting protein DprA 
Protein accessionYP_001683136 
Protein GI167645473 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.300262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGG GCCTGTCCGA CAAAGAGCGC CTGGCCTGGC TGAGGCTGGC GCGCACCGAG 
ACGGTCGGCC CCGTCGCCTT CGACCACCTG TTGACGCGGT TTGGTTCGGC CGAACGCGCC
CTGATCGCCC TGCCCGAACT GGCCCGGCGT GCCGGCCGCG CCTCGCCCCT GCGCCCGCCG
CCCGAGGGCG AGATCCTGGG CGAACTCGAG ATCGGCGACC GCCTCGGCGC GCGGCTGATC
TGCGCCTGCG AACCCGACTA TCCGCCCCGG CTGGCCGCCC TCGACCCGCC GCCGCCCGTG
CTGTGGGCCC TGGGCCATGC CGATCTGCTG TCCCAGCCCA GCCTGGCCAT CGTCGGGGCG
CGGATCGCCT CGGCCGGCGG CCAGCGCTTC GCCCGCCAAC TGGCCACCGA GCTGGGTCGG
CACGGCTATG TGGTGGTCTC GGGGTTAGCG CGCGGCGTCG ACGGCGCGGC CCACGAGGGC
GCCCTGGCGA CCGGCACCGT CGCCGTGCTG GGCGGCGGGG TGTGCGACAT CTATCCGCCC
GAGCACGCCG CCCTGCACGC CCGCATCGCC GGCGAGGGCG GCTGCATCGT CAGCGAAAGC
GCCCCCGACC GCCGCGCCAT CGCCAAGGAC TTCCCGCGCC GCAACCGCAT CATCTCGGGC
TTGTCGCTGG GCGTGGTGGT GGTCGAGGCC GAGCTGAAGT CCGGCTCGCT GATCACCGCC
CGCCTGGCCG CCGAGCAGGG TCGCGACGTG TTCGCCGTGC CCGGCTCGCC GCTCGACCCG
CGCGCCAGGG GTCCCAACGA CCTGATCCGC CAGGGCGCGA TCCTCTGCGA GGGCGTCGAG
GACGTGCTGC GCTCGCTGTC GGGCCAGGCC CACCTGCGCG AGCGCGAACG CCCCTACGCG
GCCGAGGACG ACGACGCCGA GATCGACCAC GAGGCCCTGC GCGAGGAAGT CGCCCGCCTG
CTATCGCCCA CCCCGGTCTC GCGCGACGAC CTGGTCCGCG CCACCCGCGC CCCGACCTCG
GCGGTGATGG CGGCCTTGGT GGAACTGGCC CTGGCCGAGC GGGTGGAACT GCTGCCGGGC
GGGATGGTGG CGGGGGTTTG A
 
Protein sequence
MTRGLSDKER LAWLRLARTE TVGPVAFDHL LTRFGSAERA LIALPELARR AGRASPLRPP 
PEGEILGELE IGDRLGARLI CACEPDYPPR LAALDPPPPV LWALGHADLL SQPSLAIVGA
RIASAGGQRF ARQLATELGR HGYVVVSGLA RGVDGAAHEG ALATGTVAVL GGGVCDIYPP
EHAALHARIA GEGGCIVSES APDRRAIAKD FPRRNRIISG LSLGVVVVEA ELKSGSLITA
RLAAEQGRDV FAVPGSPLDP RARGPNDLIR QGAILCEGVE DVLRSLSGQA HLRERERPYA
AEDDDAEIDH EALREEVARL LSPTPVSRDD LVRATRAPTS AVMAALVELA LAERVELLPG
GMVAGV