Gene Caul_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3934 
Symbol 
ID5901396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4257557 
End bp4259206 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID641564455 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001685557 
Protein GI167647894 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.217875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.113832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCTA CACAGCAGCG CGACGCGATC ATCATCGGGG GCGGCCACAA CGGCCTAGTC 
TGCGCCTTCT ACCTGGCCAG CGCCGGATTG AAGGTCACGG TCTGCGAAGC CCGCGACGTG
GTCGGCGGCG CCGCCGTCAC CGAGGAGTTC CACCCGGGCT TCCGCAACTC GGTCGCCAGC
TACACGGTCA GCCTGCTCAG CCCCAAGGTC ATCGCCGACA TGGACCTGCA CGGCCACGGC
CTGCGGATCC TGGAGCGGCC GATCTCCAAC TTCCTGCCGA TCGACAACCA CAGCTACATG
AAGCTGGGCG GCGGGCTGGA GCGCACCCAG GCCGAGTTCT GCAAGTTCTC GACCAGGGAC
GCCGAGCGCT TGCCGGCCTA CTACGCCATG CTCGACGAGA TTGGCGACGT ACTGCGCGAC
CTGGCCGGCG AGACCCCGCC CAATCTCGGC GACGGCCTGC CGGGCCTGCT GCGGGCCCTG
CGCCAGGGCG GACGGATGGC CGGCCTTTCC CGCGAGCGCA AGCGCGACCT GCTGGACCTG
TTCACCAAGA GCGCCCGCGA CTTTCTCGAT GGCTGGTTCG AGAGCGACGC GGTCAAGGCC
AGCTTCGGCT TCGACGCCGT GGTCGGCAAC TTCGCCAGCC CCGACAGCCC CGGTTCGGCC
TACGTCCTGC TGCACCACAC CTTCGGCGAG GTGAACGGCA AGAAGGGCGC CTGGGGCCAT
GCGGTCGGCG GCATGGGCTC GATCACCCAG GCCATGGCCA AGGCCTGCCG CGCCAAGGGC
GTGGAGATCC TGCTCAAGGC ACCGGTCGAG GCGATCCATG TGGAGGATAC CGCGAAAGGC
GGCGGCAAGC GCGCCGTGGG CGTGCAACTG GTCGACGGCC GCCAGATCAT GGCCCCGATC
GTCAGCTCCA ACCTCAACCC CGCCCTGCTG TACGGCAAGC TGGTCGCCCC CTCGGCCCTG
CCCGCCCCAT TCCGCAAGGC GATCAAGGGC TACAAGAACG GCTCGGGCAC CTTCCGGATG
AACGTGGCCC TGTCGGAGCT GCCGGACTTC ACCTGCCTGC CCGGCAAGGC CGCGGCCGAG
CACCACCAGT GCGGGATCGT GCTGTCGCCG ACCCTGGACT ACATGGACGA GGCCTATCGC
GACGCCAAGG CGACCGGTAT TTCCAAGAAA CCCATCGTCG AGATCCTGAT CCCCTCGACC
CTGGACGACA GCCTAGCCCC GCCTGGCCAG CACGTGGCCA GCCTGTTCTG CCAGCAGTTC
GCCTGGGACC TGCCCGACGG CCGCTCCTGG GACGACGAGC GCGAGGCCGC CGCCGACCTG
ATCATCGACA CGGTCAACCA GTGGGCGCCC AACTTCAAGG CGTCGGTCCT GGGGCGGATG
ATCCTGTCGC CCGTGGATCT GGAGCGGAAG TTCGGCCTGG TCAACGGCGA CATCATGCAC
GGTCACATGT CGCTGGACCA GCTGTGGGCG GCCCGCCCGG TGCTGGGCCA CGCCAGCCAC
CGGGCGCCGA TCAAGGGGCT CTACATGTGC GGCGCGGGCT GCCATCCGGG CGGCGGGGTC
TCGGGCAATC CGGGGCGGAA CGCGGCCCGC GAGATCCTGA GGGATCGGGA TTTCGCCACG
GCGGTGAAGC TGAGCGTGGT GGGGCGGTGA
 
Protein sequence
MAATQQRDAI IIGGGHNGLV CAFYLASAGL KVTVCEARDV VGGAAVTEEF HPGFRNSVAS 
YTVSLLSPKV IADMDLHGHG LRILERPISN FLPIDNHSYM KLGGGLERTQ AEFCKFSTRD
AERLPAYYAM LDEIGDVLRD LAGETPPNLG DGLPGLLRAL RQGGRMAGLS RERKRDLLDL
FTKSARDFLD GWFESDAVKA SFGFDAVVGN FASPDSPGSA YVLLHHTFGE VNGKKGAWGH
AVGGMGSITQ AMAKACRAKG VEILLKAPVE AIHVEDTAKG GGKRAVGVQL VDGRQIMAPI
VSSNLNPALL YGKLVAPSAL PAPFRKAIKG YKNGSGTFRM NVALSELPDF TCLPGKAAAE
HHQCGIVLSP TLDYMDEAYR DAKATGISKK PIVEILIPST LDDSLAPPGQ HVASLFCQQF
AWDLPDGRSW DDEREAAADL IIDTVNQWAP NFKASVLGRM ILSPVDLERK FGLVNGDIMH
GHMSLDQLWA ARPVLGHASH RAPIKGLYMC GAGCHPGGGV SGNPGRNAAR EILRDRDFAT
AVKLSVVGR