Gene Caul_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3709 
Symbol 
ID5901165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4004649 
End bp4005833 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content66% 
IMG OID641564220 
Producthypothetical protein 
Protein accessionYP_001685334 
Protein GI167647671 
COG category[S] Function unknown 
COG ID[COG3748] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.191189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTATG ACGTCACCAC CTGGCTGAAC CTGGCCCTGC GCTGGCTGCA CGTGATCGCC 
GGCGTCGCCT GGATCGGCGC CTCGTTCTAC TTCGTCTGGC TGGACAACAA CCTGCGCGCC
CCCGAGCCGC CCAAGGACGG CGTGAAGGGC GAGCTATGGG CCGTGCATGG CGGCGGCTTC
TACCATTCGC AGAAGTACAT GACGGCCCCG GCCCACATGC CCGACCATCT GCACTGGTTC
AAATGGGAGG CCTACACCAC CTGGCTCAGC GGCTTCGCCC TGCTGATCGT GCTCTACTAT
GTCGGCGCGC CGGTCTATCT GATCGACGCC TCCAAGCACG CCTTCAGCCA GCCCGGGGCC
ATCGCCACGG GCCTGGCCTT CATCTTCGGC GGCCTGGCCG TCTACGAGGC CCTGTGCCGT
TCGCCGCTGG GGGGGAAGCC CCGCCTGTTT GGCCTGGTCT GGTTCCTGGC CCTGACCGGC
GCGGCCTATG CCCTGACCCA TCTCTTCAGC GATCGGGGCG CCTTCATCCA TGTCGGCGCG
ATCATCGGCA CGGCCATGGT CGGCAACGTG TTCCTGGTGA TCATCCCCAA CCAGCGCAAG
ATCGTCGCCG ACATGCTGGC CGGCCGCAAG GTCGATCCGC GCCTGGGCGC GATGGGCAAG
CAGCGCTCGG TGCACAACAA CTACATGACC CTCCCGGTCA TCTTCATCAT GATCAGCAAC
CACTATCCGG TGGTGACAGG TCACCAGATG GCCTGGCTGC TGCTGGCGAT GATTAGCCTG
GGCGGGGTGT CGATCCGCCA CTTCTTCAAC CTGCGCCACC ACGGGATCAT CAAGCCCGAC
TTCCTGTTCA TCGGGGCGAT GCTGGTGTTC GCCGTCAGCC TGATCGCCAG CTCCAGGCCC
AAGCCCGCCG AGACCGTCTC GAACGTGCCC TTCCCGGTCG CCCTGGCGAT CGTCCAGAAG
CACTGCGTGA TGTGTCACGC GGCCGTTCCG ACCCACAAGG GCTTCACCGC CCCGCCGAAC
GGCGCGGCCT TCGACACGCC CGAAGGCCTG GCCCGCTATG CGCCCAAGAT CCGTGAACGG
GCGGTCGAGA CCACCAGCAT GCCCCTTGGA AACGAGACTC ACATTACCGA TCAGGAACGC
GCCCAGCTGG GCGCCTGGAT TGAGGCGGGA GCGAAGACGA AGTGA
 
Protein sequence
MDYDVTTWLN LALRWLHVIA GVAWIGASFY FVWLDNNLRA PEPPKDGVKG ELWAVHGGGF 
YHSQKYMTAP AHMPDHLHWF KWEAYTTWLS GFALLIVLYY VGAPVYLIDA SKHAFSQPGA
IATGLAFIFG GLAVYEALCR SPLGGKPRLF GLVWFLALTG AAYALTHLFS DRGAFIHVGA
IIGTAMVGNV FLVIIPNQRK IVADMLAGRK VDPRLGAMGK QRSVHNNYMT LPVIFIMISN
HYPVVTGHQM AWLLLAMISL GGVSIRHFFN LRHHGIIKPD FLFIGAMLVF AVSLIASSRP
KPAETVSNVP FPVALAIVQK HCVMCHAAVP THKGFTAPPN GAAFDTPEGL ARYAPKIRER
AVETTSMPLG NETHITDQER AQLGAWIEAG AKTK