Gene Caul_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0301 
Symbol 
ID5897575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp337544 
End bp338989 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID641560785 
Productradical SAM domain-containing protein 
Protein accessionYP_001681936 
Protein GI167644273 
COG category[R] General function prediction only 
COG ID[COG1964] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGGT CTATTACCGC GCCGATCTCC GGCGAGAGCG CCTCGCGAAA GGTCGCCCCC 
TACCTGTTCC TGGGCCAGAC CACGAGCCTG TGTGAAACCT GTCTGGCCCT GGTTCCGGCC
AAGATCATCG CCGAGGACGA GCGGGTCTAC TACCTCAAGC GCTGTCGCGA ACATGGGGTC
CAGAAGACCT TGATCGCCGA TGATCTCGGC TACTGGAAAG CCCAGAAGGA CTGGCTCAAG
CCCGGCGACC GGCCACTGGC CGTCCAGACG CGCACCGACC ACGGCTGCCC CTACGACTGC
GGCCTGTGTC CGGACCACGA GCAACATTCC TGCTTGGCGA TCCTGGAGAT CAACGAGGCC
TGCAACCTGA CCTGCCCGGT CTGCTTCGCC GGCTCGTCGA CCAGCCTCGA TGCTCACCGG
CCGCTGGCCG AGGTGGAGCG GATGCTGGAC GTGATCGTCG CCTCCGAGGG CGAGCCGGAC
CTCGTGCAGT TGTCGGGCGG CGAGCCCACC CTGCATCCGC AGTTCTTCGA GATCCTGTCG
GCCGCGCGTG CCCGTCCCAT CCGCCACCTG ATGATCAACA CCAATGGCCT GCGGATCGCG
CGCGAGGCCG GCTTCGCCGA GCGGCTGGCG ACCTACATGC CCAGGTTCGA GGTCTATCTG
CAGTTCGACA GCTTGAAGCG CGACGCCCTG ATGCAGCTGC GCGGCGCCGA TCTGGTCAAG
GTCCGCACCC AGGCGTTGGA GGCGCTGGAG CGAAATAATA TACCCACCAC TCTGGTGGTG
ACGCTGAAGA AGGGCGTCAA CGACGACGAG ATCGCCGACA TTGTCCGCTT CGCCCTGACT
TGGCGATGCG TGCGCGGCGT GACCTTTCAA CCGATCCAGG ACGCGGGTCG CAACGACGGC
TTTGAGGCCA AGGACCACCG CATCGTCCTG ACCGAGGTTC GCCGGCGGAT CGCCGAGGCC
GGGGTGTTCG CCCTGGAGGA TCTCATCCCC CTGCCATGCA ATCCCGATCA GATCTGCATC
GGCTATGGCC TGCGTAACGG TCAGAGCGTG GCGCCTGTCA CCTCCCTCCT GCCGCGTGAG
TTGTTCGTGG CCGCCGCGCC AAGCACTGTG ACGTTCGAGG CCTATCCGGA ACTGCAGAAA
AAGGTCTTCG ACCTGCTGTC GCTATCGACG GCCCAGGCTG ACACCTCCGA CAAGCTGGCG
GGCCTGCTGT GCTGCCTGCC CGAAGCGGTG GTCCCTGAAA GCCTGGCCTA TGAGCACACC
TTCCGGGTTG TCATCTCGCA GTTTCTCGAC CGCTACAATT TCGACCTCGG AACCGTGAAG
CGCTCATGCG TGCACTTCGT CGAGCCGGAC GGGCGGATCA TTCCGTTCGA CACCTACAAC
ACCTTCTATC GCCCCGGCGC CGCGGGCGCA GGGGCGCTCG CGCGCGGTCA AGGGAGGGCG
ATATGA
 
Protein sequence
MDGSITAPIS GESASRKVAP YLFLGQTTSL CETCLALVPA KIIAEDERVY YLKRCREHGV 
QKTLIADDLG YWKAQKDWLK PGDRPLAVQT RTDHGCPYDC GLCPDHEQHS CLAILEINEA
CNLTCPVCFA GSSTSLDAHR PLAEVERMLD VIVASEGEPD LVQLSGGEPT LHPQFFEILS
AARARPIRHL MINTNGLRIA REAGFAERLA TYMPRFEVYL QFDSLKRDAL MQLRGADLVK
VRTQALEALE RNNIPTTLVV TLKKGVNDDE IADIVRFALT WRCVRGVTFQ PIQDAGRNDG
FEAKDHRIVL TEVRRRIAEA GVFALEDLIP LPCNPDQICI GYGLRNGQSV APVTSLLPRE
LFVAAAPSTV TFEAYPELQK KVFDLLSLST AQADTSDKLA GLLCCLPEAV VPESLAYEHT
FRVVISQFLD RYNFDLGTVK RSCVHFVEPD GRIIPFDTYN TFYRPGAAGA GALARGQGRA
I