Gene Caul_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1879 
Symbol 
ID5899334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2016543 
End bp2017751 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID641562369 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001683506 
Protein GI167645843 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.520155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG TCATCATCGG AGCAGGGCAT GCGGGCGGCA CGGTCGCCGC TCTCCTGCGC 
CAGCTCGGCC ATGACAAGCC GATCGTTCTG GTCGGCGAAG AGCCGCATCC TCCCTATCAG
CGTCCGCCGT TGTCGAAGGG TTGGCTGAAG GGGGAACTGG GCGAGGACGG CCTGCTGCTG
CGCCCGCGCG CTTGGTACGC CGAAAACAAC GTGGACCTAC GCACCTCGAG CCGCGTGGTC
GGCATCGATC GTCAGACGCG ACGGCTCACG CTGTCGACCG ACGAGACCCT CGACTATGAC
ACCTTGATCC TGGCGACCGG CGCCCGCGCG CGAAAACTGG TGCTGCCGGG CGGCGATCTG
AAGGGCTTCC TCGAGCTTCG CACCATTGAG GATGCGGAAG TCATAAAGGC CTGGTTCAGG
CCCGGGTTTC GCCTGGCGAT CATCGGGGGC GGTTACGTCG GCCTGGAAGT GGCGGCGTCA
GCGCGCAAGC TGGGCGCCGA GGTGGACGTT TTGGAGCGCG AGGATCGGCT GCTGGCCCGG
GTCGCCGGTC CGGTGCTGTC GTCCTTCTTC CGTGACGTCC ACGAGGAGAA CGGAGTCCGC
TTCCATTTTG GCGTAGCGGT GGAAGGGTTC GAGGGCCTGG ACGGGCAGGT GTCGGGGGTG
CGGCTGGCGG GACGGCCGAC GCTGCATTGC GACGCGGTTC TGGTCGGCGT CGGCGCCATC
CCCAATGACG ATCTGGCGAA GGCCGCGGGC CTGGCCTGCG ATGACGGGGT GATCGTCGAC
GCTCAGGCGC GGACCTCGGA TCCTCACATC TTCGCCATTG GCGACGTCAC GCGACGGCCG
ATGGCGCTCT ATGGCAGGAC CATGCGTCTG GAAAGCGTCC CCAACGCCCT TGAGCAGGCC
CGCCAAGCCG CCGCCGCCAT TGCTGGCGCG CCCGATCCCA AGCCCGAGAC GCCCTGGTTC
TGGTCCGACC AATACGACAT CAAGCTCCAG ATCGGCGGGC TGCCGTTCGA TGTTGATCAG
GTCGTGTTGC GCGGCGATCC AGCGGCCCGG AAGTTTGCGC TGTTTCATCT TTCCGAAGGT
CGAGTTCAAG CCGTGGAGGC GGTCAACAGT CCGCCGGAAT TCATGGTGGG GCGTCAGTGG
CTGGCTTCCC GCCGCGACGT CGACCCCGTC CGCCTGGCGG ACGCTTCGAT CCCGATCAAG
GAGGTCTGA
 
Protein sequence
MSTVIIGAGH AGGTVAALLR QLGHDKPIVL VGEEPHPPYQ RPPLSKGWLK GELGEDGLLL 
RPRAWYAENN VDLRTSSRVV GIDRQTRRLT LSTDETLDYD TLILATGARA RKLVLPGGDL
KGFLELRTIE DAEVIKAWFR PGFRLAIIGG GYVGLEVAAS ARKLGAEVDV LEREDRLLAR
VAGPVLSSFF RDVHEENGVR FHFGVAVEGF EGLDGQVSGV RLAGRPTLHC DAVLVGVGAI
PNDDLAKAAG LACDDGVIVD AQARTSDPHI FAIGDVTRRP MALYGRTMRL ESVPNALEQA
RQAAAAIAGA PDPKPETPWF WSDQYDIKLQ IGGLPFDVDQ VVLRGDPAAR KFALFHLSEG
RVQAVEAVNS PPEFMVGRQW LASRRDVDPV RLADASIPIK EV