Gene Caul_4800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4800 
Symbol 
ID5902262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5190292 
End bp5191395 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content64% 
IMG OID641565320 
Producthypothetical protein 
Protein accessionYP_001686418 
Protein GI167648755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.351795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGG CTGAACGACT TAAGCGTCTG ATCGCGGGGC ACGATGATCG GGCTCTGTTG 
TGGGCGGCGG TGGCCGTCGG CGCGGTGGCC CACCTTGCCG CCCTGATCCT GTTCGGAAAG
GTCGGCCCCA AGGCCAATAT CTGGGAGTAT GGCGTCCAGG CCCAATGCGC GCTGCGCACG
GGGGGAGACC TTTGCCAGTA CTACTTCAGC GGCCCACAGG GCAGCTATCC GTCAGCCTAT
ATGCCGCCCT TCCTCAGCTA CATGTGGCTG GCGTTGTTCA AGATGTTTGG CGACACCGCC
GCCGCCCGCA TCGTCTGGCT GGCGATCAAC TGGTCGATTT CGTTGATCAA CATCGGCCTG
CTGTTCCAGT TGGGCCGGCG TTGGAAGCTG TCGCCGGTGG CCTGTTTCCT CGCGGCCGTG
ACGCTGGCGC TCTATCCGAC CTTCGTGTTC GTGGTGGCGA CCTATCACCA GACCGAATGG
ACGGTGATGT TCCTGCTGCT GCTGGCCTTG CTCGGCAGCA CGGTCCTGCA ATCCGCGGAG
TCGCCGCTCA AGGCGGTGGT GTGGATGGGC GTGGTCAGCG GCTTCGCCAC GCTCAACCGC
TCGGAGATGA TCATCATCGG CCCGGCCATG ATCGTCCTGG TCTGCGCGCT GCGCCGACAG
GTCTTGCCGG TGGTCGCCGC CGCCGTGGCC ATGATCCTGG TCCTGGCGCC CTGGACGGTG
CGCAACTACC AGCTGTTCCA CCGCGTGGTG CCGGTCGCCC AAAGCGCCGG CTACAATCTG
TGGAAGGGCT TCAATCCGTA TACCAACGGC TCGGGCAACA TGACCGAAAT GCCGGGCGGA
CCGGGTGATC GCAAGCTGTC TGAGATCCGC GACCGCATTC CCCACGGCCC GATGTACGAA
CCGGCTCTGC AAGATGCCTA CAAGGAGCAG TTCAAGCACG ATCTGGCCGC GGCCGGTCCC
GTTCGGTTGG CCCAGCTGGT GATCACCAAG ACCGCGCTTC TCTGGGGCTT TGACTGGACT
GACCGCGAGA TCACCGCCCG GCCGCTCTAC CGTCTGCCCT GGCTCGTCGC CAACGCCTGG
CGCTGTATGG CCTGGTGCTG GTGA
 
Protein sequence
MALAERLKRL IAGHDDRALL WAAVAVGAVA HLAALILFGK VGPKANIWEY GVQAQCALRT 
GGDLCQYYFS GPQGSYPSAY MPPFLSYMWL ALFKMFGDTA AARIVWLAIN WSISLINIGL
LFQLGRRWKL SPVACFLAAV TLALYPTFVF VVATYHQTEW TVMFLLLLAL LGSTVLQSAE
SPLKAVVWMG VVSGFATLNR SEMIIIGPAM IVLVCALRRQ VLPVVAAAVA MILVLAPWTV
RNYQLFHRVV PVAQSAGYNL WKGFNPYTNG SGNMTEMPGG PGDRKLSEIR DRIPHGPMYE
PALQDAYKEQ FKHDLAAAGP VRLAQLVITK TALLWGFDWT DREITARPLY RLPWLVANAW
RCMAWCW