Gene Caul_4934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4934 
Symbol 
ID5902396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5331554 
End bp5332672 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID641565454 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001686552 
Protein GI167648889 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.2077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.974431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAT TGCTCAACGC CGATGTTGTG GTCGTCGGCG CCGGTGTCGT GGGTTTGGCC 
TGCGCCGCGG CCTTGGCCAA GCATCATTAC GTGCTGGTTC TCGAAGCCGA GACAGCGATC
GGCACCCAGA CGTCGTCCCG CAACAGCGAA GTGATCCACG CCGGCATCTA CCATCCCACC
GGCAGCCTCA AGCATGAGCT TTGTGTGCGC GGTCGACGTT TGCTGTATCC CTACCTGGAA
GCTCGGCAGG TCTCCTATCG GCGTAGCGGC AAGCTGATCG TGGCGACCAG CGCTGAGGAG
GATTCCAAGG TCGAGGCGAT CCATCGTCAG GCGTTGGGCA ACGGCGTCGA AGGCATGCGA
TTGCTCAGCG GGGCCGAGGC CCGTGCTCTG GAGCCCAATC TGCGTTGCAC CTTGGCCACG
CGATCTTCGG AAACCGGCAT CGTCGATAGC CATGGCTTGA TGCTGGCCCT GCAAGGCGAG
ATCGAGGATG CCGGAGGAGC GATCGCCTTC GGCGCGCCCG TGCTGAGCGG CGAAATTCTC
GACGGCGGCG GCTTTGAGCT CGATGTCGGC GGCGAGCACC CAGTGCGCTT GCGCTGCGCC
ACCCTGGTCA ACGCCGGCGG CCTGAAGGCT CAGGCCCTCG CCGCCGCGAT GAGGCGCCGT
CCCAACGCCG TGCCTCCTCT GAGCTTGGCC AAGGGATCCT ATTTTAGCTA CGGGGGCGCG
CCGGCCTTTT CGCAATTGAT CTATCCCGCC CCCGTGGACG GCGGTCTGGG CGTCCACGTG
ACCTTGGACT TGGCGGGGCG GATGCGTTTT GGCCCCGACG TCGAGTGGCT GGATCACGAT
GATCCGGACT CCGTCGACTA CGCTGTTGAC CCGCGTCGGG CGGACGCCTT CTATGCCGCC
GTGCGTCGCT ATTGGCCCGG CCTGCCGGAC GGCGCCCTGG TCCCCGACTA CGCCGGTTGC
CGTCCAAAGC TCAGCGGTCC CGGCGCCGCC GCCGATTTTC GGATAGACGG GCCGCGGACG
CACGGCCAAG AGGGGCTCGT GGAGCTGTTC GGCGTTGAAT CGCCGGGGCT CACCAGCGCG
TTGGCGATCG CGGAATACGT GGTTTGCGCG CTGTCTTAG
 
Protein sequence
MTELLNADVV VVGAGVVGLA CAAALAKHHY VLVLEAETAI GTQTSSRNSE VIHAGIYHPT 
GSLKHELCVR GRRLLYPYLE ARQVSYRRSG KLIVATSAEE DSKVEAIHRQ ALGNGVEGMR
LLSGAEARAL EPNLRCTLAT RSSETGIVDS HGLMLALQGE IEDAGGAIAF GAPVLSGEIL
DGGGFELDVG GEHPVRLRCA TLVNAGGLKA QALAAAMRRR PNAVPPLSLA KGSYFSYGGA
PAFSQLIYPA PVDGGLGVHV TLDLAGRMRF GPDVEWLDHD DPDSVDYAVD PRRADAFYAA
VRRYWPGLPD GALVPDYAGC RPKLSGPGAA ADFRIDGPRT HGQEGLVELF GVESPGLTSA
LAIAEYVVCA LS