Gene Caul_2977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2977 
Symbol 
ID5900432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3234496 
End bp3235701 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content74% 
IMG OID641563474 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001684602 
Protein GI167646939 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.540318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.494242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGA TCAACTCGGT CCGGCGCCGG CCCTGGCAAG ATGAAGGCCC ACCTTCGTTG 
GATCGCCACG CCATGATCGA CGCGCCCCTA TCGAGCCAGC GCCAGGCGCC CACGACGGGC
CGGGCGACCG CCTTGGCCGC AGCCGCGAGG CCGGAGACGC CCTCCAGGGT TCCCAGCACA
GCCCCAGCCC CTCGGGCCGC CGCCCGGCCC GTGTGGGTTT GGCTGGCCAT CGCGGCGATG
GCGGTCGCGG CGTTGGTCGG GGGAGCCGTG CTGTTGCGCC CGCCGACGAT CGTCCCCGCG
GCGGTCACGT CGGGCGAGGC CGTCGATGTG GTCTATGCGT CCGGCGTGGT GGAATACGTC
CGCCAGGCCC ATGTCGCGCC GGTGACCACC GCGCCGATCC GCCAGGTCGC CGTCGCCGAG
GGCGAGCGCG TCGTCGCGGG GCAATTGCTG GCCCAGCTCG AGGATGGTCC GTCACAGGGG
ACGGCGCTGC AATTGGCGGC CCAGGCGGTC CAGGCGCGTG TGACCGCCGA CCGCACGCGG
CGCCTGTTCG ACGCCGGCTT CGCCGCCAAG GCCGCCGACG ACGACGCCCA GGCCCAGGCC
CGGGCCGCCG AGGCCGCCGC CCAGAGCGCC CGGGCGCGCC TGCGCGACTA TCGCCTCACC
GCGCCCTTCG CCGGCCGGAT TCTGCGGCGG GACGCCGAGC CCGGCGATCT GGCCAGCGCG
GGCGCTACGC TCTTCCTGCT GGCCGACACC AAAGCGGTGC GCGTCACCGC CGACGTCGAT
GAGCGCGACA TCGCCAAGCT GAAGCCCGGC GCCCAGGCCC TCATTCGCGC CGACGCCTTC
GCCGGTCGGG TGTTCACCGG CCAGATCGCC CAGATCACCC CGGCCGGCGA CGCCACCGGC
CGTGTGTTTC GCGTCCGTAT CGCCTTGCCG CCGGGCGGGG CTCTGGCGCC GGGCATGACG
GTCGAGACCA ATCTGGTCGC CGAGCGCCGC GCGCATGCGA TATTGGCGCC GGCCAGCGCG
GTGAGCGACG GCGCGGTCTG GCGGATCGCG AAGGGTCGCG CCTATCGGAC ACCTGTGCGC
GTGGGAGCGG CCAGCGCGGC GCGGGTCGAG ATCGTCAGCG GCCTTTCGGC TGGCGACGTC
GTGGTCGCCG CGCCGTCCAA GACGCTTCGC GACCGGCAAC GCGTGCGCCT GGCCGCCTCA
CGATGA
 
Protein sequence
MITINSVRRR PWQDEGPPSL DRHAMIDAPL SSQRQAPTTG RATALAAAAR PETPSRVPST 
APAPRAAARP VWVWLAIAAM AVAALVGGAV LLRPPTIVPA AVTSGEAVDV VYASGVVEYV
RQAHVAPVTT APIRQVAVAE GERVVAGQLL AQLEDGPSQG TALQLAAQAV QARVTADRTR
RLFDAGFAAK AADDDAQAQA RAAEAAAQSA RARLRDYRLT APFAGRILRR DAEPGDLASA
GATLFLLADT KAVRVTADVD ERDIAKLKPG AQALIRADAF AGRVFTGQIA QITPAGDATG
RVFRVRIALP PGGALAPGMT VETNLVAERR AHAILAPASA VSDGAVWRIA KGRAYRTPVR
VGAASAARVE IVSGLSAGDV VVAAPSKTLR DRQRVRLAAS R