Gene Caul_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2604 
Symbol 
ID5900059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2824704 
End bp2825711 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content71% 
IMG OID641563095 
ProductNifR3 family TIM-barrel protein 
Protein accessionYP_001684229 
Protein GI167646566 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.131824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATA CTCTCTCAGT CGGTAAGGTC GAGGTGCCCG GACGCGTTTG GATCGCGCCG 
ATGACGGGCG TCTCCGATCT ACCCTTCAGG GAAACCGCCA CCCGCCTCGG CGCGGCCTAT
GTGGCGACCG AGATGGTGGC CTGCGCCGAA TTTGCGCGTG GACGGCCCGA CGCCGTGCGC
CGCGCCGCCG TGGGCGACGG CCTGCCCCTG ATGGTCGTCC AACTGGTCGG CCGTGATCCC
ACCTTCATGG GCCAGGGCGC GCGGATGGCC GCCGAGGCCG GGGCCCAGAT CATCGACCTG
AACTTCGGTT GCCCTTCCAA GCAGGTCACC GGGGGCGTGG CCTCCGGCTC GGCCCTGATG
CGCGAGCCGG ACCTGGCGGA AGCTCTGGTC GCCGCCGCCG TCCGGGCCGT CGACGTGCCG
GTCACCGTCA AGATGCGCCT GGGCTGGGAC GACGACAGCC GTAACGCCGC CGACATCGCC
CGCCGGGCCG TCGACGCCGG GGCGCAGGCG ATCACCGTCC ACGGCCGCAC CCGCTGCCAG
TTCTACAAGG GCGTGGCCGA CTGGAGCGCC GTGGCGGCCG TCAAGGCGGC GGTGTCGGTT
CCGGTGCTGG TCAATGGCGA CATCATCGAC GGCGACACCG CTCGCCTGGC CCTGGAGCAG
TCCGGCGCCG ACGGGGTGAT GATCGGCCGC GGCGTCTATG GCCGCCCGTG GATCGCCCAA
GCCATTGAGG CGGCCCTGAA CGGCGAGGGC TTCCGCGAAC CGGACGCCGA GGAGCGCCTG
GCCATCGCCG TCACCCATTT CCGCCGCAGT CTGGGCTTCT ACGGCCAGAA CCTCGGCCTC
AAGATGTTCC GCAAGCACCT GGCCTCCTAC ATCGAGGCCG CGCCCTGGCC CGATAGCGAG
GAACTTCGCC GCACCGCGCG CGCCGCCCTG TGCCGCCTGG AGGATCCCGC CGCGATCGAG
GACGGCCTGG CCGCTCTGTG GCTGGGCGAC CGGAGGCTGG CCGCATGA
 
Protein sequence
MSNTLSVGKV EVPGRVWIAP MTGVSDLPFR ETATRLGAAY VATEMVACAE FARGRPDAVR 
RAAVGDGLPL MVVQLVGRDP TFMGQGARMA AEAGAQIIDL NFGCPSKQVT GGVASGSALM
REPDLAEALV AAAVRAVDVP VTVKMRLGWD DDSRNAADIA RRAVDAGAQA ITVHGRTRCQ
FYKGVADWSA VAAVKAAVSV PVLVNGDIID GDTARLALEQ SGADGVMIGR GVYGRPWIAQ
AIEAALNGEG FREPDAEERL AIAVTHFRRS LGFYGQNLGL KMFRKHLASY IEAAPWPDSE
ELRRTARAAL CRLEDPAAIE DGLAALWLGD RRLAA