Gene Caul_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3483 
Symbol 
ID5900938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3760585 
End bp3761946 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content58% 
IMG OID641563989 
Producthypothetical protein 
Protein accessionYP_001685108 
Protein GI167647445 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCTG AGAGGCAAGC CCTCTACGAC TTCATCAAGG CGCAGTTCCA TGACCTCGGA 
GCCGATGGTT TCCAAGAGTG GTGGCATAGC CTGGACGAGG AGACGTTCGA GCTTGTCGAG
CAGGCCCTTT CCGATCCCGG CTTCGGCCTA AACCCACACC AGATCATGCC CGATGGCGAA
TGGCGCATCT GGGCGCTTTT CATGGGCCGT GGTGGTGGCA AGACCTACGC GGCCAGCAAG
GGCTCAAACG TCCTAGCTGA AGAAGTCTTT CCAGGCGGTA CGGGCATCCT CGTCGGCGCG
ACCGTCAAGG ACGTGCGCGA CACCATGATC GAGGGCGAGA GCGGCATCAT CGCGACCGCG
CGTCCCGGCT TCGTCCCCCA CTACAACAAA CACGACAACG TCTTGATCTG GCCGAACGGC
TCCAAGGCGC TGATCCGTAC TGCAGACAAC CCCGAGGACA TTCGCGGTCC AACCGTGAAC
TGGGCCTGGG CCGACGAACT GGTCAAATGG CGTAGCGAAA AGTCATGGGA CAACCTCAAC
CGATGCGTCC GTAACCTCCA CGAAAACGGC ACGAAGATCA TCGTGACCAC GACGCCCAAG
AAGGCGAAGC AGTGGATCAA AGACATTGAG AGCCTGCCGG GCACAATCGT TTCCCGCGCA
TCTTCACTCG ATAACCCTCA CATGGATGCA GCGTATCTTG AGGGCATTCG ACGCGAAGCT
GAGACCGGAA GCGCACGCGC ACGCGAAGAG ATTTTCGGCG AGTGGATCGA GGGAGACGGC
GAGCTTTGGA CTGAGAAATC CATTGAGGAA ATGCGCCAAC GACCCAGCGT CTCGCTGGAG
GTCATGGCGA AGTCGATGGA CCGTCGATAC ATTAGCGTTG ACCCATCGTC AGGCAAGCAC
GACGAAACGG GTATCATGCT CATGGGTAAG AAGGCGGGCC GGGTTTACGT GCTCGCGGAC
TTCACGTCAG GGGGCAACAT CAACCAGTGG ACAGACGAGA TTGTTCAACT CGCAAAGTCC
TACCTACAAC CCGGCGACAT CATCCTCCTT GAGGTGAACA TGAACGCCGC CGCGCAGAAC
GTGTTGGAGC AAAAAGACCG CAGCCTTCGC ATCGTCCCTG TGACCGCAAC CCGCTCCAAA
TGGCACCGCG CGGAAGAGGC ATTTTCGCAC TGTCAGTCAG GCCATGTTGT GTTCTGGCAT
ACGCATCCGA AGTTGGAGCT ACAGCTTCGC GAGTGGGAAC CCGAAATGAA GAAATCGCCT
GACCGAGGCG ACGCATTTAC GCAGGGTGTC AACTACGCGA TGGGAACGCA TGGCAGGGGT
TTGAGCGTGC CATTCTTCAC CATCCAGGGG TTCAACCGCT GA
 
Protein sequence
MRAERQALYD FIKAQFHDLG ADGFQEWWHS LDEETFELVE QALSDPGFGL NPHQIMPDGE 
WRIWALFMGR GGGKTYAASK GSNVLAEEVF PGGTGILVGA TVKDVRDTMI EGESGIIATA
RPGFVPHYNK HDNVLIWPNG SKALIRTADN PEDIRGPTVN WAWADELVKW RSEKSWDNLN
RCVRNLHENG TKIIVTTTPK KAKQWIKDIE SLPGTIVSRA SSLDNPHMDA AYLEGIRREA
ETGSARAREE IFGEWIEGDG ELWTEKSIEE MRQRPSVSLE VMAKSMDRRY ISVDPSSGKH
DETGIMLMGK KAGRVYVLAD FTSGGNINQW TDEIVQLAKS YLQPGDIILL EVNMNAAAQN
VLEQKDRSLR IVPVTATRSK WHRAEEAFSH CQSGHVVFWH THPKLELQLR EWEPEMKKSP
DRGDAFTQGV NYAMGTHGRG LSVPFFTIQG FNR