Gene Caul_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0866 
Symbol 
ID5898321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp921034 
End bp922839 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content76% 
IMG OID641561349 
ProducttRNA U-34 5-methylaminomethyl-2-thiouridine biosynthesis protein MnmC 
Protein accessionYP_001682495 
Protein GI167644832 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating)
[COG4121] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03197] tRNA U-34 5-methylaminomethyl-2-thiouridine biosynthesis protein MnmC, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.205507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC CCACCGCCTC CCCCCTGATC TGGCGCGACG ACGGCATGCC CCAGTCGGCG 
CTGTACGGCG ACGTCTATTT CTCCAGCGCC GACGGCCTGG CCGAGACCCG CGCGGTCTTC
CTGGCCGGCT GCGACCTGCC GGCCGCCTGG GCGGGCCGCG ACCATTTCAC CGTCGGCGAA
CTGGGCTTCG GCACCGGCCT GAACATCGCC GCCCTGCTCG ACCTCTGGCG GCGCGAGAAG
GCGCCCCACT CCATGCAGGG GGGGCAGCGC CTGCACATCT TCTCGATCGA AGCCCATCCG
ATCACCCGCG ACGAGGCGGC GCGCGCGCTC GCCGTGTGGC CGGAGCTGGG CGAGGCGGCG
AGCGTGCTGC TCGACCACTG GCCAGGCCTG GCGCGCGGCT TCCATCGCAT CGACCTGCCG
GGCTTCGACG CCACCTTCGA CCTGGCGGTG ATGGACGTCG AGCCGGCCCT GGCGGCCTGG
GACGGCGCGG CGGACGCCTG GTTCCTCGAC GGCTTCTCCC CGGCCCTCAA CCCGGCCATG
TGGCGCGAGG AGATCCTGGC CGCCGTGGCC GCCAGAAGCG CTCCCGGCGC GCGCGCCGCC
ACCTTCACCG TCGCCGGCGC AGTGCGGCGC GGCCTGTCCG CCGCCGGCTT CCAGGTCGAC
AGGCGCCCGG GCTTCGGCCG CAAGAAACAG CGGCTGGAGG CGGTGGCGCC CGGCGTCGCG
GCGTCGCCCC CGCGACCGCG CCGCCTGGCG GTGATCGGCG GCGGCATCGC CGGCGCCGCC
ATGGCCCGCG CCGCTCGCGC CGAGGGCCTG GAGGCGATGA TGTTCGACGA CGGGCAGGCG
CCGGCCTCGG GCAATCCCGC CGCCCTGGTC ACGCCCGCCC TGGACGCTGG CGGCGGCCCC
CGCGCCGCCC TGCCCGCCCA GGCCTTCGCC CGCGCCGCGG CCCTCTACGA AGCCCTGCCC
GAGGCGGTGA TCGCCCGCGG GGCGCTGAAA CTGTCGGTCG TGCCGCGCGA CGAGGCCCGC
CATGCCGCCG TCGCCGACCA GGACCTGTTC GAACCCGGAA CCATGGCGGT GCTCGACGCC
GCGACCGCCA CGGCGCGACT GGGCGAGCCG GTCGGAGAAG CCCTGTCGAT GTCGCAGGCC
CTGGTGGTGG AGCCGGCCCG GGTGCTCGAC GCCTGGCGGG GCGAGGTGAT CGACGCCCAG
GTCGCCCGGC TGGCGCACGA AGATGGGGTC TGGCGCCTGC TGGGATCTGA CGACCAGCTC
CTGGCCGAGG TCGACGCCGT GGTCCTGGCG GGCGGCGCCG GCCAGGCGCG GCTCTGGCCC
GACGCCCCGC TGCGGCCGAT TCGCGGCCAG ACCAGCTGGA CGGACCGACC GCTCGCCTTT
CCGACGCCCG CCGCCTTTGG CGGCTATGTC GCGCCCACCC GCGACGGGAT GCTGTTTGGG
GCCACCCACG ATCGTGACGA CGTGGGGACC GACGCTCGCG CCGAAGATGA CCGCCGCAAT
CTGCGGGCCT TGGCCGAGGG CCTGCCCAAG CTCGCGGCCA GCCTGGCGGA CGCGCCGCTG
CGGGGCCGGG CCGCCGTTCG CGCCACGACC GCCGACCACC TGCCGGTGGC CGGCGCGGTT
CCCGGGGCCG CGCCGGGACT GTTCGTGCTA GGCGGCCTGG GCGGCCGCGG TTTCTGCCTG
GCGCCCCTGC TGGCCGAGCA CCTGGCGGCC CGGATTCTCG CCCTGCCCTC GCCCTTGCCC
CGCCCCCTGT CGGCCCTGGT CGAACCCGGG CGATTTTCGT CGCGCGTCGC GACCGGCGCG
GTATAG
 
Protein sequence
MSDPTASPLI WRDDGMPQSA LYGDVYFSSA DGLAETRAVF LAGCDLPAAW AGRDHFTVGE 
LGFGTGLNIA ALLDLWRREK APHSMQGGQR LHIFSIEAHP ITRDEAARAL AVWPELGEAA
SVLLDHWPGL ARGFHRIDLP GFDATFDLAV MDVEPALAAW DGAADAWFLD GFSPALNPAM
WREEILAAVA ARSAPGARAA TFTVAGAVRR GLSAAGFQVD RRPGFGRKKQ RLEAVAPGVA
ASPPRPRRLA VIGGGIAGAA MARAARAEGL EAMMFDDGQA PASGNPAALV TPALDAGGGP
RAALPAQAFA RAAALYEALP EAVIARGALK LSVVPRDEAR HAAVADQDLF EPGTMAVLDA
ATATARLGEP VGEALSMSQA LVVEPARVLD AWRGEVIDAQ VARLAHEDGV WRLLGSDDQL
LAEVDAVVLA GGAGQARLWP DAPLRPIRGQ TSWTDRPLAF PTPAAFGGYV APTRDGMLFG
ATHDRDDVGT DARAEDDRRN LRALAEGLPK LAASLADAPL RGRAAVRATT ADHLPVAGAV
PGAAPGLFVL GGLGGRGFCL APLLAEHLAA RILALPSPLP RPLSALVEPG RFSSRVATGA
V