Gene Caul_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3306 
Symbol 
ID5900761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3582394 
End bp3583818 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content73% 
IMG OID641563812 
ProductRND efflux system outer membrane lipoprotein 
Protein accessionYP_001684931 
Protein GI167647268 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCC TCAAGCACCT TGCCGTCGCC GGCTCCGCCC TGGCCCTGCT GTCGGCCTGC 
GCGGCCGGTC CGGCCTACGA GAAGCCGGAG GTCTCGGCGA CGAGCCTGCC CGCCACCTAC
AAGGCGCTCG ACGGCTGGAA GCTCGCCACG CCGGGCGACC TGACCGACCG CGGCGACTGG
TGGACCTTGC TGGGCGATCC CCAACTCGAC GCCCTGATCG GCAGGGTCAA CGTCTCCAAC
CAGACCATCG CCGCCGCCGA GGCCAACTAC CGCCAGGCCC GCGCCATCGT CCGCGAACAG
CGCGCCAGCC TGTTCCCGAC CGTCGACCTG TCGGGCTCGG CGACCAAGTC AGGCGGTTCG
GGCTCGACCG GATCTGGCGC CGGCGCCTCA AGCAGCGGTC GGCGCTATCA GGTCGGCATC
GGCGCCAGCT GGGAGCCGGA CCTCTGGGGC CGCGTCCGCG CCGGCGTCAG CGGGGCCAAG
GCCAACGCCC AGGCCAGCCA GGCCGATCTG GCGGGCGCCC GGTTGTCGAT GCAGGGCGAG
CTGGCCGTGA ACTATCTGGG CCTGCGCCAG ACCGACGCCG AGATCGCCCT GGTCGCCAAG
ACGGTGGAGG GCTACCAGCG CAGCCTGACC ATCACCCAGA ACCGCTACGC CGCCGCCATT
GCGCCCAAGT CGGACGTGCT GCAGGCCACC ACCCAGCTGG CTGGAGCCCA GGCCGATCTG
GAGAGTCTGC GACAGACCCG GGCGACCTAC GAGAACGCCA TCGCCACCCT GGTCGGCGAA
CCGGCCAGCG GCTTCAAGCT GGCCGCCGAT CCGGCCTGGA GCGCCGGCGT GCCCGAGATC
CCGGCCGGCG TGCCCTCGAC CCTGCTGGAG CGCCGCCCCG ACATCGCCGC CGCCGAGCGC
CGCGTGGCCG CCGCCAACGC CGACATCGGG GTGGCCCGCG CCGCCTTCTT CCCGACGTTT
GGCCTCAGCG CCTCGGGCAA TTCCGGCGCC TCCGGCCTGG GGAGCCTGTT CTCCGCCTCG
GCCAACACCT GGTCGCTGGG GCTCAGCGCC GCCCAGACCT TGTTCGACGC CGGAGCCCGC
AAGGCCCGCG TGGAGCAGGC CAGGGCGTCC TACGACGCCA CGGTCGCCGA CTACCGCCAG
ACGGCGCTGA GCGCCTTTGA GGACGCCGAG AACCAGTTGA CGGCCGTCGG GGCCTTGGAG
CGCCGCTATG CCCTGCTGAA GACCTCCTCG GACGCCGCCG ACCAGACCGA GCAGATGCTG
CTTAACCAGT ACAAGGCCGG GCAAGTGGCC TACACCGACG TCGTCCAGGC CCAAGCCTCG
GCCTTGTCGG CGCGCCGTTC GCTGCTGACC GCGGCGGTGG CCCGCCAGAC GACCGCCGTC
GCCCTGATCC AGGCGCTGGG CGGCGGCTGG AAGACCGGCG GCTAG
 
Protein sequence
MPALKHLAVA GSALALLSAC AAGPAYEKPE VSATSLPATY KALDGWKLAT PGDLTDRGDW 
WTLLGDPQLD ALIGRVNVSN QTIAAAEANY RQARAIVREQ RASLFPTVDL SGSATKSGGS
GSTGSGAGAS SSGRRYQVGI GASWEPDLWG RVRAGVSGAK ANAQASQADL AGARLSMQGE
LAVNYLGLRQ TDAEIALVAK TVEGYQRSLT ITQNRYAAAI APKSDVLQAT TQLAGAQADL
ESLRQTRATY ENAIATLVGE PASGFKLAAD PAWSAGVPEI PAGVPSTLLE RRPDIAAAER
RVAAANADIG VARAAFFPTF GLSASGNSGA SGLGSLFSAS ANTWSLGLSA AQTLFDAGAR
KARVEQARAS YDATVADYRQ TALSAFEDAE NQLTAVGALE RRYALLKTSS DAADQTEQML
LNQYKAGQVA YTDVVQAQAS ALSARRSLLT AAVARQTTAV ALIQALGGGW KTGG