Gene Caul_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3931 
Symbol 
ID5901393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4252718 
End bp4255798 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content70% 
IMG OID641564452 
Productacriflavin resistance protein 
Protein accessionYP_001685554 
Protein GI167647891 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.415531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.145125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCA ACCTCGCCGG CTTCGCGGTT CACCGCTGGC AGTTCACCCT GGTCGCCTTT 
GGCCTGCTGG TCATGCTGGG GGTGAACGCC TTCCTGAACG TGCCGCGTTC GGAGGACCCG
CACTTTCCGG TGCCGATCGT CATCATCCGC GCCGTGCTGC CGGGGGCCGA GCCCTCGGAG
ATGGAGCAGT TGGTCGCCGA CCCGATCGAG GACGCCATCG ACGGCCTGGA CGACATCGAC
AAGGTCGAGT CGACCAGCGT GGACGGGGCG GCGATCATCC GCGTGCACTT CGCCTGGAAC
GTCGATCCCG AGCGCAAGTA CGACCAGGTG GTGCGCGAGG TGAACGCCAT TCGCGGCGCC
CTTCCATCCG GCTTGGCGCG GCTGGAGATC GAGCGGGTGC GCACCACCGA GGTGGCCATC
GTCCAGGTCG CCCTGACCAG CGACGTCCTG CCGATGCGCC GGCTGGAGAA GGTGGCCGAC
CGGCTGCGCG AGCGGCTCGA CCGCGTGCCC GGCGTGCGCC AGGCCCGCTA CTATGGCGCG
CCCAGCAGCG AGGTGCGGGT GTCGCTGGAC CTGGCCCGGC TGTCGGCCCT GAAACTGCCG
GCCACGGCGG TGTCCGACGC CCTGAAGGGG GCCGGAGCCG AGGCGCCGAT CGGGGCCGTG
CAGGCCGGCG ACCGCCGCTT CAACGTCAAG GCCGGCGGGG CCTTCCGCTC GCTGGACGCG
GTCAAGGACA CCCCGGTGCG CTCGGTCGGC GGCCAGGTCG TGCGGGTGCG CGACGTCGCC
CAGGTGGCCT GGGCCCAGGA CGAGCCGACC CACCTGACCC GCTTCAACGG CAAGCGCGCG
GTGTTCGTCA CCGTCACCCA GAAGGACGGC CAGGACGTCG CCCGGATCAC CACCGCCGTC
GACCGGGTGA TGGACGACTA CGAGAAGACC CTGCCGGCCG GGGTCAAGCT GGAGCGCGGT
TTCGTCCAGG CCAGGAATGT CGAGCACCGG CTGGGCAACC TGTTTCGCGA CTTCGCCATC
GCCTTGGCCC TGGTGCTGAT CACCCTGCTG CCGCTGGGAC CGCGCGCCGG CCTGGTGGTG
ATGGTGTCGA TCCCGCTGTC CCTGCTGATC GGCCTGAGCC TGCTGCAGGC CTTCGGCTTC
ACCCTCAACC AGCTGTCGAT CGCCGGCTTC GTGCTGGCCC TGGGCCTGCT GGTCGACGAC
AGCATCGTCA TCACCGAGAA CATCGCCCGC CGCATCCGCG AGGGCGAGGC GCGCACCGAG
GCGGCCGTCA ACGGCGCCAA CCAGATCGCC CTGGCGGTGC TGGGCTGCAC CGCCTGCCTG
ATGCTGGCCT TCCTGCCGCT GATGGCTCTG CCCGGCGGCT CCGGCGCCTA TATCAAGTCG
CTGCCGGTGA CGGTGCTGTG CACGGTCGGC GCTTCGCTGC TGGTGTCGAT GACCATCATC
CCGTTCCTGG CCAGCCGGGT GCTGGACAAG TCCTCCGACC CCGAGGGCAA CCCCCTGCTG
CGCGGCGTCA ATGGCGCGAT CCGCCGCCTC TATCGGCCGG TGCTGCACTT TGGCCTGGCC
CGGCCGTGGC TGTCGCTGGC GATCATGCTG GCGATCTGCG CCACCACCGT GCCGATGCTG
AAGATCGTCG GCTCCAGCTT GTTCCCGGCC GCCGAGACCC CGCAGTTCCT GATCCGCGTC
GAGAGCCCGG ACGGCAGCCC CCTGGCCCGC ACCGACCGCG CCCTGCGCTT CGTCGAGGCG
CGCCTGAAGC AGGAGCCCGA CGTGGTCTGG CAGGCCGCCA ATGTCGGGCG CGGCAATCCG
CAGATCTTCT ACAACATCAG CCAGCGCGAG AGCGCCACCA CCTATGGCGA GGTGTTCGTC
AGCCTCAAGG CCTGGCGTCC GGGCAAGAGC GAGAGGGTGC TGGACGGCCT GCGCCGCGAC
TTCGCCCGCT TCCCCGGGGC GCGGATCAGC GTCGTCACCT TCGAGAACGG CCCGCCGATC
GACGCGCCGG TGGCCGTGCG GATCACCGGC CAGAACCTGG ACGCGCTGAA GGCCCTGGCG
GCCCGGACCG AAGCGATCCT CAAGGCCACG CCGGGCACGC GTGACGTCAA CAACCCCGTG
CGGCTGGACC GCACCGACCT CGACCTCGGC GTCGACGAGG GCAAGGCCGC GGCCCTGGGC
GTGCCGGCCG GCGCCCCGCG CCGCGCCGCG CGGCTGGCCC TGTCGGGCGA GGAGACCGGC
CGCTTCCGCG ATCCGGACGG CGACGACTAC GCGGTCAAGG TGCGGCTGCC GATGGGGACC
ACCGACGGCG CCCATCCTCT TGAAGGGGGG CGCAACACCC TGGCGGATCT CTCGAAGATC
TACGTCCCCA CCGCCGACGG CGAGGCCGCG CCCCTGGGCT CGATCGCCAG CCCGACCCTG
CGCTCCAGCC CCGCCCGCAT CGACCGCTTC GACCGCGAGC GGACGGTGAC GGTGACGTCC
TATGTCCAGA CAGGCTACCT GACCGCCAAG GTCACGGCCG ACGCCCTGGA CCGCCTGAAC
CGGCAACTGC CCATGCCTCC TGGCTACCGC CTGTCGCTGG GCGGCCAGGC CGAGGCGCAG
TCGGAAAGCT TCGCCGGATT GGGCGCGGCG GTGATGGTGG CGGTGTTCGG GATCCTGGCG
GTGCTGGTTC TGGAGTTTCG CAAGTTCAAG ACGGCCCTGG TGGTGGCCGG CATCATCCCG
TTCGGCCTGT TCGGGGCCGT GGCGGCGCTG TGGATCACCG GCTATTCGCT GAGCTTCACC
GCCACGATCG GGCTGATCGC GCTCATCGGG ATCGAGATCA AGAACTCGAT CCTGCTGGTC
GATTTCACCG AGCAGCTGCG ACGCGACGGC ATGGGCTTGC ACGACGCCAT CGAGAAGGCC
GGCGAGGTGC GGTTCCTGCC GGTGCTGCTG ACCTCGGTCA CGGCGATCGG CGGGCTGCTG
CCGCTGGCGC TGGAAGGTTC GGGGCTCTAT TCGCCGCTGG CCATCGTGAT CATCGGCGGG
CTGATCACCA GCACGGTGCT GTCGCGGGTG GCGACGCCGG TGATGTACTG GCTGACGGCG
CGGGGGGAGG AGCGGACCTA A
 
Protein sequence
MKFNLAGFAV HRWQFTLVAF GLLVMLGVNA FLNVPRSEDP HFPVPIVIIR AVLPGAEPSE 
MEQLVADPIE DAIDGLDDID KVESTSVDGA AIIRVHFAWN VDPERKYDQV VREVNAIRGA
LPSGLARLEI ERVRTTEVAI VQVALTSDVL PMRRLEKVAD RLRERLDRVP GVRQARYYGA
PSSEVRVSLD LARLSALKLP ATAVSDALKG AGAEAPIGAV QAGDRRFNVK AGGAFRSLDA
VKDTPVRSVG GQVVRVRDVA QVAWAQDEPT HLTRFNGKRA VFVTVTQKDG QDVARITTAV
DRVMDDYEKT LPAGVKLERG FVQARNVEHR LGNLFRDFAI ALALVLITLL PLGPRAGLVV
MVSIPLSLLI GLSLLQAFGF TLNQLSIAGF VLALGLLVDD SIVITENIAR RIREGEARTE
AAVNGANQIA LAVLGCTACL MLAFLPLMAL PGGSGAYIKS LPVTVLCTVG ASLLVSMTII
PFLASRVLDK SSDPEGNPLL RGVNGAIRRL YRPVLHFGLA RPWLSLAIML AICATTVPML
KIVGSSLFPA AETPQFLIRV ESPDGSPLAR TDRALRFVEA RLKQEPDVVW QAANVGRGNP
QIFYNISQRE SATTYGEVFV SLKAWRPGKS ERVLDGLRRD FARFPGARIS VVTFENGPPI
DAPVAVRITG QNLDALKALA ARTEAILKAT PGTRDVNNPV RLDRTDLDLG VDEGKAAALG
VPAGAPRRAA RLALSGEETG RFRDPDGDDY AVKVRLPMGT TDGAHPLEGG RNTLADLSKI
YVPTADGEAA PLGSIASPTL RSSPARIDRF DRERTVTVTS YVQTGYLTAK VTADALDRLN
RQLPMPPGYR LSLGGQAEAQ SESFAGLGAA VMVAVFGILA VLVLEFRKFK TALVVAGIIP
FGLFGAVAAL WITGYSLSFT ATIGLIALIG IEIKNSILLV DFTEQLRRDG MGLHDAIEKA
GEVRFLPVLL TSVTAIGGLL PLALEGSGLY SPLAIVIIGG LITSTVLSRV ATPVMYWLTA
RGEERT