Gene Caul_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4478 
Symbol 
ID5901939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4851350 
End bp4852747 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content71% 
IMG OID641564997 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001686096 
Protein GI167648433 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.408304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.958001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCG ATGTCGTCTC CCGCCTGAAA GCCGTGCTCG GCGAGGGCGG ATGGAGCCAG 
GATCCCGACC GCCTGGCGCC CAAGCTGCGG GAGTGGCGCG GACGGTGGAG CGGGCAGACG
CCGCTGCTGG CCCTGCCCCG TTCGACGGCC CAGGTCGCGG CGGTGGTCGG CGTCTGCGCC
GCCGAGGGGG TGGCGATCAT CCCGCAGGGT GGCAATACCG GCCTGGTCGC CGGCCAGATC
CCGCAGGGCG AGATCCTGCT GTCGACCGAA AAACTGACGA CGGTGCGCGA CGTCGATGCG
TTTGACGACG TCATGGTTCT GGAGGCCGGC GTGACCCTGG CCAAGGCCCA CGAGGTCGCC
TTGTCGGTCA ATCGCCGTTT CCCGCTCAGC CTGGCCTCGG AGGGATCCTG CACGATCGGC
GGCCTGGCCT CGACCAACGC CGGCGGCACG GCGGTGCTGC GCTATGGCGT CATGCGAGAC
CAGATCCTGG GGATCGAGGC GGTGCTGCCC AATGGCGAGG TCTGGAACGG CCTCAAGCGG
CTGCGCAAGG ACAACACGGG CTATGACTTG AAGCATCTGC TGATCGGCGC CGAGGGCACG
CTGGGGATCA TCACCGCCGC CAGCCTCATG CTCTATCCCC TGCTGGCTTC GCGGAGCGTG
GCGATCGCCG CCGTGACCAC GCCGCATGAC GCCATCGCCC TGCTGGCCCG CGCCAAGGAC
GAGACCGGCG GAGCGGTCGA GGCCTTCGAG CTGATGAGCC GTCTGGGCGT TGCCTTCGCG
CTGAAGAACA TCCCCGGACT GCGCGAACCG CTGGAGGCCG TGCATCCCTG GTACGTGCTG
ATCGAGACCG CCTCGGGCGA GCCCGGCGCG GCCGAGGCGG CCATGGAGCG GCTGCTGGCC
GGGGCGCTGG AGCGCGGCCT GATCCAGGAC GCCGCCGTCG CCCAGTCCGA AGCCCAGGCC
CAGGCCTTCT GGGCGGTGCG CGAGAACCAG TCCGGCGGCC AGAAGCCCGA GGGCGCGGCC
TGGAAGCACG ACGTCTCGGT CCCGGTCTCC AAGGTCGCCG ACTTCATCGA CCAGGCCACG
GCGGCGGTGG AAAAGCTGTC GTCCGGCGTT CGCGTCGTGG CCTTCGGCCA TGTCGGCGAC
GGCAATGTGC ATTACGATGT CCTGCGGGCC GACGGGGCGG CCGACGACCC GCACGACGCC
CTGCGCGACG CGGGCGCGCG GATCGTCCAC GACATCGTGG CCAGCATGAA CGGCTCGATC
AGCGCCGAGC ACGGCCTGGG GGCGATGAAG TCGGTCGAGG CTCTGCGCTA CAAGAGCGCC
GTCGAGGTCG AGGCCCTGCG CGCTGTCCGC GCGGCGCTCG ACCCTCAGCG GATCATGAAC
CCTCGGGTGC TGTTCTAG
 
Protein sequence
MASDVVSRLK AVLGEGGWSQ DPDRLAPKLR EWRGRWSGQT PLLALPRSTA QVAAVVGVCA 
AEGVAIIPQG GNTGLVAGQI PQGEILLSTE KLTTVRDVDA FDDVMVLEAG VTLAKAHEVA
LSVNRRFPLS LASEGSCTIG GLASTNAGGT AVLRYGVMRD QILGIEAVLP NGEVWNGLKR
LRKDNTGYDL KHLLIGAEGT LGIITAASLM LYPLLASRSV AIAAVTTPHD AIALLARAKD
ETGGAVEAFE LMSRLGVAFA LKNIPGLREP LEAVHPWYVL IETASGEPGA AEAAMERLLA
GALERGLIQD AAVAQSEAQA QAFWAVRENQ SGGQKPEGAA WKHDVSVPVS KVADFIDQAT
AAVEKLSSGV RVVAFGHVGD GNVHYDVLRA DGAADDPHDA LRDAGARIVH DIVASMNGSI
SAEHGLGAMK SVEALRYKSA VEVEALRAVR AALDPQRIMN PRVLF