Gene Caul_4431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4431 
Symbol 
ID5901892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4799978 
End bp4801207 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content73% 
IMG OID641564949 
Producthypothetical protein 
Protein accessionYP_001686049 
Protein GI167648386 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.803445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.770543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAC TCCAGACTCC CGACGTCGCC GTGATCGGCG GTGGGCCGGC CGGGCTGATG 
GCGGCCGAGA TGCTGAGCGC GGCGGGGCTG TCGGTGGCGG TGTTCGAGCG CATGCCGACC
CTGGGGCGCA AGTTCCTGAT GGCCGGGCGC GGCGGGCTGA ACCTGACCCA TTCGGAAGAC
CTTGAGCGGT TCGTGGCGCG CTACGGCGGC GCGAGCGAGC GGCTGCGGCC GATGCTGCAG
GCCTTCACGC CAGCCGATCT CGTCGCCTGG GCCGAAGGGT TGGAGCAGGA AACCTTCGTC
GGCACCAGCG GTCGGGTGTT TCCCAAGGCG CTGAAGGCCT CGCCGCTGCT GCGGGCCTGG
ATCGCGCGGC TGGAGGGGCG TGGCGTGGCG CTCAACACCC GCTCGACCTG GACGGGCTGG
AACGCGGCCG GCGACCTGGT CTTCGACACG GCGGACGGCG TTCGGACCGT GCGGCCGCGC
GCCACCATCC TGGCCGTCGG CGGGGCCAGT TGGGCCAAGC TGGGGTCGGA CGGCGCCTGG
GCGCCGCTGC TGGCCGCGCG CGGGGCGTCG CTCGCGCCGT TCAGGCCGGC CAATGTCGGC
TTCGCAGTCA CTTGGACGAA GGTGTTCCGC GAACGCTTCG CCGGCGCGCC GCTGAAGAAT
ATCGGCCTGA GCTTCGAGGG TCAGGCCTCG CGGGGCGACG CCCTGGTGGC GGCCTACGGC
CTGGAGGGCG GGGCGGTGTA CGCCCTGTCG GCGGCTCTGC GCGACGCGAT CCTGGCGCGA
GGCTCGGCGA CCCTGGACAT TGACCTGCGT CCCGACGTCC CCCTGGCCCA ACTGACCGCG
CGCCTGTCCA GGCCGCGCGG CGGGCAGTCG CTGTCGAGCT GGCTGCGCAA GGCCGCCCAC
CTGTCGCCGG TCGAGATCGG CCTGCTGCGT GAAGCCCACG GCATGGCCCT GCCGGTCGCG
CCCGACGCCC TGGCGGCGGC GATCAAGGCC GCGCCGATCG TGCTGACCGG AACGCAGGGG
CTGGAGCGGG CCATCTCCTC GGCCGGCGGC CTAAGCTTCG AGACCCTCGA CGGCCTGGCG
TTGAAAGGCG CGCGAGGGGT GTTCGCGGCG GGCGAGATGC TGGACTGGGA GGCCCCGACT
GGCGGCTACC TGCTGCAGGC CTGTTTCGCG ACCGGGGTGG CGGCGGCGCG CGCGGTGGTG
GAGCATCTTC AGGCCTGCGG TCGAGCGTGA
 
Protein sequence
MTELQTPDVA VIGGGPAGLM AAEMLSAAGL SVAVFERMPT LGRKFLMAGR GGLNLTHSED 
LERFVARYGG ASERLRPMLQ AFTPADLVAW AEGLEQETFV GTSGRVFPKA LKASPLLRAW
IARLEGRGVA LNTRSTWTGW NAAGDLVFDT ADGVRTVRPR ATILAVGGAS WAKLGSDGAW
APLLAARGAS LAPFRPANVG FAVTWTKVFR ERFAGAPLKN IGLSFEGQAS RGDALVAAYG
LEGGAVYALS AALRDAILAR GSATLDIDLR PDVPLAQLTA RLSRPRGGQS LSSWLRKAAH
LSPVEIGLLR EAHGMALPVA PDALAAAIKA APIVLTGTQG LERAISSAGG LSFETLDGLA
LKGARGVFAA GEMLDWEAPT GGYLLQACFA TGVAAARAVV EHLQACGRA