Gene Caul_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1999 
Symbol 
ID5899454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2145364 
End bp2146776 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content63% 
IMG OID641562488 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_001683625 
Protein GI167645962 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTA CGCTCCGAAT GACCGGGCGC CATGCTCGAC GCCTTCATGC TCACCTCTTC 
CCGGGGGATG GCAAGGAAGC CGTGGCCATT GCTCTGTGCG GCCGGCGCAG GGGAGCCTAC
GAGCGCCTGC TCGTGCACAA GCTGGTTCTT ATTCCCCACG GGGACTGCGA CCTGCGCACG
CCGCTCACTG TGGCTTGGTC GACCGACTTG ATCGTGCCGG CCCTCGAAGA AGCCGAGCGC
CGCGGGTGGA GTGTGGTGAA GTTCCATAGT CACCCTGGTG GCTACAACAT GTTCTCGGAT
CAAGACGACC TCTCCGATGG CCTGCTTTTT CCGGCTATTC ATGGATGGGT CGAGCACGAG
GTGGCCCACG CCAGCGTCGT AATGCTGCCT GACGGAGCGA TGTTTGGCCG GACGGTCGAC
GCTCAGGGCG TCTTCTCGCC CCTGGAAGCT ATCGTCGTGG CCGGCGAGCG GATCGAGATT
TGGCGCCATA GTGAAGTAAC GGGCGAGGGT GTCATCGCGC CCCTGCCGGA TTTCGCCAAG
CGTCATGCCC AAGCGTTCGG CGTGCGCACC ACCCGGCGGC TCAGCCACCT CTCGGTCGCC
GTGGTCGGCT GCTCGGGCAC AGGCAGCATC GTGATCGAAC AGCTCTACCG GCTGGGTGTT
GGGCGATTGG TGATCGTCGA CCCCGACGTG GTCAAGGACA TCAACCTCAA TCGGATTTTG
AACACCACGT CTGCCGACGC GGCGGCTGCG CGCGCCAAGG TCGAGGTGCT GCATGACACC
ATCGTGCGTA CCGGGCTCGG GACCGACGTT CTGCCGATCG CCAAGAGCCT GTTCGACCCC
GAGGCGATCG CCGCCGTGGC CGACTGCGAT CTGGTCTTTG GATGTGTGGA TTCCGCCGAG
GCGCGGTTCC TGATCAATCG TATCACCGCA TTTTACGTGA TGCCGTACTT CGACGTCGGC
GTTGCACTCG ACGCCGACCA GGCCGGACGG ATCACCCAGG TTTGCGGCTA TCTGCATTAC
GTACAGCCTG ACCAATCGAG CATGGTCAGC CGCGGTGCAA TTTCGATGGA GGAGGTGCGG
GCCGAGGGCG AGAAGCGCCG CAATCCCGAG CACTACGCAA ATCTGCGGCA GGCCGGGTAC
ATCCAGAATG TCGACGAAGA CCGGCCGGCG GTCATCAGCG TCAACACCGT GTTCTCAGGT
CTGATCGTCA ACGAGTTTTT AGCGCGTCTT CACGATTTCC GGGACGATCC GGGCGACGCC
TACGCCACGA TTGGCTTCAG CTTGAGCCAG ATGATGTTCT ATCCCGAGGC CGAAAGCGGC
ATGCCGTGCC GCGTGTTCTC GCCTCACGTT GGGCGGGGCG ATACGCGCTT GCTGCTCGAC
ATGCCTGAGT TCAGCCTGGG GCAGCGCTCG TGA
 
Protein sequence
MAATLRMTGR HARRLHAHLF PGDGKEAVAI ALCGRRRGAY ERLLVHKLVL IPHGDCDLRT 
PLTVAWSTDL IVPALEEAER RGWSVVKFHS HPGGYNMFSD QDDLSDGLLF PAIHGWVEHE
VAHASVVMLP DGAMFGRTVD AQGVFSPLEA IVVAGERIEI WRHSEVTGEG VIAPLPDFAK
RHAQAFGVRT TRRLSHLSVA VVGCSGTGSI VIEQLYRLGV GRLVIVDPDV VKDINLNRIL
NTTSADAAAA RAKVEVLHDT IVRTGLGTDV LPIAKSLFDP EAIAAVADCD LVFGCVDSAE
ARFLINRITA FYVMPYFDVG VALDADQAGR ITQVCGYLHY VQPDQSSMVS RGAISMEEVR
AEGEKRRNPE HYANLRQAGY IQNVDEDRPA VISVNTVFSG LIVNEFLARL HDFRDDPGDA
YATIGFSLSQ MMFYPEAESG MPCRVFSPHV GRGDTRLLLD MPEFSLGQRS