Gene Caul_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2601 
Symbol 
ID5900056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2821740 
End bp2822855 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content71% 
IMG OID641563092 
Producthypothetical protein 
Protein accessionYP_001684226 
Protein GI167646563 
COG category[S] Function unknown 
COG ID[COG4129] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.220305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAG CCGCGCCCAG CGACTGGCGG CGTTTCGTCG GCCTGGAGTT TTCCCGGGCG 
GCCCTGGCCA GGGCCGCCAC GCGCAAGACC GAGTTGCGCC ACGCCGTGCG GGTGTCGGCG
GCCGTCGGGG CGGCCTACGC CGTGGCGAGC CTGCTGCGCC TGCCCCAGGG CTATTGGGCG
GTGTTCACCG CCGTGATCGT CGTCCAGTCG AGCCTCGGCG CGACGATCAC CGCCTCGATC
GAGCGTTTCA TGGGCACGGT CGTGGGCGCG CTGGCCGGAG CCGGCGCGGC CATGCTGCAC
GCCCGCTGGC CCGAGGCGGG CGGACCGATC CTGGTGGTCA CCGCCGCCTT GCTGGCCTTC
CTGGCCGCCG TGCGCCCGGC CTTCAAGGTG GCGCCGGTCA CCGCCGTGAT CATGCTGATC
GGCACGACCA CCCACATGGA CCCGTTGGTG GCCGCCTTCC TACGAACGGC CGAGATCACG
GTGGGCAGCG TCGTCGGCAT CGCCGCCACC TTGCTGATCT TTCCGGCCCG CGCCCACGCC
TCGGTGGTCG CCAGCACCGA GCAGGTCGCG GGCCTGCTGG CCGACCTGCT GGAGCACTAC
GCCCTGAAAC TGAAGGGCGG CGCGACCGAG CTCGAGGCCC GCGACCACTA CGACGACACG
CTCAAGGCCC TGTCCAAGCT GCAGACCGCC ATGACCGAGG CCGACCGCGA GAGCGCCAGC
AAGTTGAGCG ATCATTCGGT GACGGACGCC CTGCCTCGCA CCCTTTGGAG GCTGCGCAAC
GACTGCGTGA TGATCGGCCG GGCCCTGCGC GAAGCCTTTC CCTCGCCCGG CCTGACTTTG
CAGTCGGCGG CGATGCTCAA CGCCAGCGCC GTTTTCCTGC GGACCAGCGC GGCCCTGCTG
TCGGGCGGTC CACGCCCCGA CCGCATCACC TTCGCCCAGG CCCATCAAGC CTTTCAGGCG
GCGGTGGAGG CCCTGCGGGC CGGCGGCGGC ACCCGCGCCC TGGCCTTCGA TGACGCGGCC
CGGGTGTTTG GCTTGGTGTT CGCGGTCGAG AACCTGTTCG GCAATCTCGG GGACTTCGAG
GAACGGGTCG AGGAGACGGT GGGCAAGCGG GGCTGA
 
Protein sequence
MPEAAPSDWR RFVGLEFSRA ALARAATRKT ELRHAVRVSA AVGAAYAVAS LLRLPQGYWA 
VFTAVIVVQS SLGATITASI ERFMGTVVGA LAGAGAAMLH ARWPEAGGPI LVVTAALLAF
LAAVRPAFKV APVTAVIMLI GTTTHMDPLV AAFLRTAEIT VGSVVGIAAT LLIFPARAHA
SVVASTEQVA GLLADLLEHY ALKLKGGATE LEARDHYDDT LKALSKLQTA MTEADRESAS
KLSDHSVTDA LPRTLWRLRN DCVMIGRALR EAFPSPGLTL QSAAMLNASA VFLRTSAALL
SGGPRPDRIT FAQAHQAFQA AVEALRAGGG TRALAFDDAA RVFGLVFAVE NLFGNLGDFE
ERVEETVGKR G