Gene Caul_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2446 
Symbol 
ID5899901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2668431 
End bp2669681 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content70% 
IMG OID641562937 
Producthypothetical protein 
Protein accessionYP_001684071 
Protein GI167646408 
COG category[S] Function unknown 
COG ID[COG3174] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.385923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.422644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGG AACCCTTCGA ACGGCTTGGA CTGGCCTTGG CCATCGGTTT CCTGATCGGC 
ATCGAACGCG GCTGGCGCGC GCGCGAGGTC GCCGAAGGAG GACGCGCCGC GGGGTTGCGC
ACCTACGCCC TGTCGGGTCT GCTCGGCGGG GTCTCGGGGA TGCTGAGCCA AGCCCTGGGC
GGCTGGGCCA TGCTGACGCT CGGACTGCCG TTCGCCGCGG CCTTCATCCT CTTCAAACGG
CAGGAACAGC GAGACGACAA CGACTATTCC GTCACCGCCA TCGTCGCGGG CCTGCTGACC
TTTGGTCTCG GGGCGCTGGC GGCGATCGGC GATGAGCGGG TCGCCGCCGC CGCCGCCGTG
GCGGTCGCGG CTCTACTGGC GGGCAAGGAG GTGTTGCATA CCTGGCTCAA GCGCCTGACC
TGGCCCGAAC TCCGAGACGC GCTGGTCCTG CTCGCCATGA CCTTCGTGGC CCTGCCGCTT
TTGCCAAACC GTCCCTTTGG ACCCTACGGC CTGGTCAATT TCGCCGAACT CTGGGTGCTG
ACCATCGCCA TGGCGGGGAT TTCCTTTGTC GCCTACGCCG CGATCAAGGT GTGGGGGCCG
GCGCGTGGCG CCTTGCTGGC CAGCGCGGCT GGCGCGCTGG TCTCTTCGAC GGCGGTGACC
TTCTATCTGG CTCGACTGCA GAAGACGGTC TCCAACCCCC TTGCCCTGGC GGGCGCGGCT
CAGGTGGCCA GCGCCGTCAT GGCGATCCGG CTGGGCGCCA TAACCTTGGC GCTTTGGCCA
CCCCTGTTCT GGTCGCTGGC CGCGCCGCTC GGGGTGTTCG CCGCCCTCTC GACGATCTTT
GGCCTAGGGG CGACCGCCTT CGCATCGACC AGGGACGCCT CGCCGTCGCC ATCCTCGGCC
AAGAGTCCAT TTGAACTGGC GCTGGTGTTG AAATTCGCGC TGGCGCTGGG CGTCATCATG
GCGGCGGCCA GGGTCGCCGC CGGCCTCTAC GGTCCGTCCG GCCTCCTGCC CGTGGCGGCG
CTTGGCGGCC TGGTCGACGC CGACGCGGTC ACTCTGGCGG CCGCCCGCAT GACCTCGAAA
GGCATGGCCA TCGGCATCGC CGGCCAGGCG GTGCTCCTGG CCGCGGCGGT CGACAGCGTC
TCAAAGATGG TCATCGCCTG CGCGGTGGGC GGCTCGCGCT TTGGGGCGCT CTATTCGGCG
GGCACCCTGT TGGCGCTGGG CGCCGCGGCC GGCGCCTGGG CATGGGGCTA G
 
Protein sequence
MAAEPFERLG LALAIGFLIG IERGWRAREV AEGGRAAGLR TYALSGLLGG VSGMLSQALG 
GWAMLTLGLP FAAAFILFKR QEQRDDNDYS VTAIVAGLLT FGLGALAAIG DERVAAAAAV
AVAALLAGKE VLHTWLKRLT WPELRDALVL LAMTFVALPL LPNRPFGPYG LVNFAELWVL
TIAMAGISFV AYAAIKVWGP ARGALLASAA GALVSSTAVT FYLARLQKTV SNPLALAGAA
QVASAVMAIR LGAITLALWP PLFWSLAAPL GVFAALSTIF GLGATAFAST RDASPSPSSA
KSPFELALVL KFALALGVIM AAARVAAGLY GPSGLLPVAA LGGLVDADAV TLAAARMTSK
GMAIGIAGQA VLLAAAVDSV SKMVIACAVG GSRFGALYSA GTLLALGAAA GAWAWG