Gene Caul_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3206 
Symbol 
ID5900661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3465947 
End bp3467374 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content71% 
IMG OID641563711 
ProductTAP domain-containing protein 
Protein accessionYP_001684831 
Protein GI167647168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.812713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGAT GGATGCTGGC GGCCGGATTG GCGGCGGGAA CCTGGCTGGC GGCGACCAGC 
GCCCAGGCCG CCGAGCCGAA ATTCACCCCC GGCCCCTGCG CCGGCGACTA CAGCGGCGTC
AGCAACAGGA TCGAGTGCGG GACGCTGGTG GTCGACGAGA CCCGCGGCGG ACCCAGCACG
CGGCGCGTCG CCCTGCCGGT GACCATCGTC AAGGCCAGCG CCCCCAAGCC CGGCGCCGTG
CCGGTCATCT ATCTGCACGG CGGTCCCGGC GGCGGCGTGG TCGAGGCCCT CGGCCGATCC
TTGCGCGGCG CGGCCGGCCG GGAACTGATC GCCATCGACC AGGACTGGAT CTATTTCGAC
CAGCGCGGCG GCGGTGTGGC CTCGCCGATC CTCGACTGCG GCGCGGTGGC CCTGAACGAC
GCCGGCCCGC TGAACGACGC CGCCGCCCAG CAACTGATCG CTTGCGGTCG GCGGCTGAAA
GCCTCGGGCG TCGACCTGTC GCGCTACAAC GCCGAGGAGG TGGCCAAGGA CATCCAGGAC
CTGCGCAAGA CCCTGGGCCT CAAGCAGATC GACCTGTTCG GCGTGTCCTA CGGCACCCGC
ATCGCCCTGG CCGTGGTCAA GCATCAGCCG CGAGGCGTCC GCGCCGTGGT CCTCGACTCG
CCCTGGACGC CGGAGGCCAA GTGGGCCGAG GGCGGACCGG AGATGGTGTC GGACGCCGTG
AAGGAGATCT TCAAGCGCTG CGCGGCCGAC GCCGCCTGCA ACGCCAAATA TCCCCATCCC
GCCGCCGACC TCGACGCCGT CGCCGACACG CTGCTGAGCG GCCCGCAAGA GATCGGCGGC
AAGGTCTACG CCGCCGACGA CCTGGGCGGC TTCCTGATGG ACGCGGCCTA TAGCGGCCCC
GACGCCCGCG CTTTGCCCGC CACGGTGGCC AGGTTCGCGG CCGGCGACAT GACCGCCCTG
GCCCAACAGA TGGAGGGTCG CAGCGGCTAC AACGAGGCCC AGCACCTGAC TCATCTGTGC
AAGGAGGAGT TCCCGTTCGA GAGCGAGGCG GCGATGCGCA AGGGGGCTGG GCGCGACTCC
GTTTCGCGGC TGCTGGAGGC CTCGATGGGT CGCTACTTCC AGGTCTGCAA GGCCTATGAT
GTCGGCGCCC CCGATCCGGT CGAGGCCCTG CCGGTCAGCA GCGCCATCCC AACCCTGTTC
CTGGCCGCCG AGATCGATCC CGGCTGCCCG CCGGCCGTCG CCAAGGCGGC GGTGGGCCGG
TTCGCCAAGG GCCAGCTGAC CATCATCCCC AACACCACCC ACGGCGTGTC GCGCGGCAGC
GCCTGCGCCC GCAAGATGAT CCGCGCCTTC CTGGCCGACC CAACCGCGCC GATCGACCAG
AGCTGCCTGC ACCCCGAGCA CGACAAGTTC GTGTTCGATT TGGACTAG
 
Protein sequence
MRGWMLAAGL AAGTWLAATS AQAAEPKFTP GPCAGDYSGV SNRIECGTLV VDETRGGPST 
RRVALPVTIV KASAPKPGAV PVIYLHGGPG GGVVEALGRS LRGAAGRELI AIDQDWIYFD
QRGGGVASPI LDCGAVALND AGPLNDAAAQ QLIACGRRLK ASGVDLSRYN AEEVAKDIQD
LRKTLGLKQI DLFGVSYGTR IALAVVKHQP RGVRAVVLDS PWTPEAKWAE GGPEMVSDAV
KEIFKRCAAD AACNAKYPHP AADLDAVADT LLSGPQEIGG KVYAADDLGG FLMDAAYSGP
DARALPATVA RFAAGDMTAL AQQMEGRSGY NEAQHLTHLC KEEFPFESEA AMRKGAGRDS
VSRLLEASMG RYFQVCKAYD VGAPDPVEAL PVSSAIPTLF LAAEIDPGCP PAVAKAAVGR
FAKGQLTIIP NTTHGVSRGS ACARKMIRAF LADPTAPIDQ SCLHPEHDKF VFDLD