Gene Caul_2473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2473 
Symbol 
ID5899928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2690030 
End bp2693230 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content68% 
IMG OID641562964 
Productacriflavin resistance protein 
Protein accessionYP_001684098 
Protein GI167646435 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGG GCCTTTCGGG ACGCCTCACC AAGGCGTCGA TCAAGTCGCC CCTGACGCCG 
CTGATCCTGC TGGCGGCGAT CGCCGTGGGC CTGCTGGCGC TGATCTCGAT CCCCCGCGAG
GAGGAGCCGC AGATCAGCGT GCCCATGGTC GATATCATTG TCGCCGCGCC GGGCCTGCGC
GCGCCAGACG CCATAGAGCT GGTCGGCAAG CCTCTGGAGA CGATCGTCAA GAGCGTCGCC
GACGTCGAGC ACGTCTATAC CTTCGCCGAC GACAACCAGG TGATGGTCAC CGCGCGCTTC
AAGGTCGGGA CCGATCCGGA TTCGGCGGCC GTGCGCATCC ACGAAAAGGT CCGCGCCAAT
TACGATCGCA TTCCGGCGGG CATTCCCGAG CCGCTGATCC AGACGCGAGG CATCAACGAC
GTTCCCAGCC TGGTGCTGAC CCTGTCTCCC AAGCCCGGAG CCGCGGGCCA CTGGACCGAT
CAGGCTCTGT ATGAACTGGC GGGAAAGCTG AGAACCGAGG TCGCCAAGGT CGACAATGTC
GGCCTGACCT TCATTGTCGG CGGACGGCCC GAGGAGATCC GCATCGCCCC CGATCCGGCG
CGCCTGGCCC AGCACGGCGT GTCGCTGGCC GCTCTGATGG ACACGGTGCG CCAAGCCACC
CGCGCCTTTC CCGCCGGCCA AATTCGCGGC GGCGGCCAGG CGATCGACGT CACGGCAGGC
CGCAGCCTGA CCAGCGCCGT CGACATCGGC CTGCTGGCCC TGCCCTCGGC CAGCGGCCAG
GCGATCTATG TCCGCGACGT CGCCGACGTC ATCCAGGGGC CACGCGAGGA TCAGGCTCGC
GCCTGGCGCT ATGCCCGCGT CGACGGCGGC TGGAGCCAGG CGCCCGCGGT CAGCCTGGCC
ATCGCCAAGC GCAAGGGCGC CAATGCGGTC GTCGTCTCGC AGGCGGTGCT CGCTCGGGTC
GAGGCCCTGA AGGGCTCGCT GCTGCCTGAC AGCCTGGATG TGGCGGTGAC CCGCGACTAC
GGCGCCACGG CCAATGAGAA GGCCAACGAG CTTCTGTTCC ACCTAGGCCT GGCCACCCTG
TCGATCGTGG TGCTGATCGG CCTGGCCATC GGCTGGCGCG AGGCGGCGGT GACCGCCGTG
GTCATTCCCA CCACCATCCT GCTGACCCTC TTCGCCTCCA ACTTGATGGG CTACACCATC
AATCGGGTCA GCCTGTTCGC CCTGATCTTT TCGATCGGCA TCCTGGTCGA TGACGCCATC
GTCATGATCG AGAACATCGC CCGCCACTGG GCGATGGCCG ACGGCCGAAG CCGGATCGAC
GCGGCGGTCG ACGCCGTGGC CGAGGTCGGC AATCCGACCG TCGTCGCCAC CCTGACCGTT
GTCTCGGCCC TGCTGCCGAT GCTGTTCGTC TCGGGCCTGA TGGGTCCCTA CATGGCGCCG
ATCCCCGTCA ACGCCTCGGC GGCCATGGTG TTCTCGTTCT TCGTCGCCGT GGTCATCGCG
CCCTGGCTGA TGGTGCGGTT CGCCCGCAAG ACCCTGGCGG CCGGCGGCCA TGGGCATGAC
GGCGAAGGCA AGCTCGGCGC GCTGTATCGC CGGGTCGCCA GCCGGGTGAT CGCCACGCGT
AGGAGCGCCT GGACCTTCCT GATCGGCGTG GGCCTGGCCA CCTTGCTGGC CTGCGCGATG
TTCGCCACCA AGACCGTGAC GGTGAAGCTG CTGCCGTTCG ACAACAAGTC CGAACTTCAG
GTGGTGCTGG ACATGCCGGA GGGGACCTCG CTGGAGGCCA CCGCCCGGGC GCTGTCCGAC
GCGGCGGTGA TCACCCGCGC CCTGCCCGAG GTCACGGCGA TCGACGCCTA TGCGGGCACG
GCCTCACCAT TTAATTTCAA TGGCCTAGTG CGCCACTACT ACCTGCGCAA TCGCCCCGAC
CAGGGCGATC TGTCGGTGGC CCTGGCTGAA AAGAGCGAAC GCAAGCGCTC CAGCCACGTC
GTGGCTCTGG ATCTGCGCAA GCGGCTGGCC AAGGTCGCCC TGCCGGCCGG CGCGTCGATA
AAGGTGGTCG AGGCCCCGCC AGGACCGCCC GTGATGGCCA CCCTGTTGGC CGAGATCTAC
GGCCCCGACG CCAAGACGCG GCGCGCCGTG GCCGAGCGGG TCAAGGCCAC GTTCAAGTCC
GTGCCCTATA TCGTCGACAT CGACGACAGC TACGGTCAAC CCCAGCCCGG CCTGCGCCTG
GTCCCTGATC GTGACCGCCT GGAAGCCCTG AAGGTCAACG ATCGCGAGGT CCTGGACTCG
ATCGGCGCGG CCCTCGGGGG GCAGGTCGTC GGCTACGCCC ATCGTGGCGA GGGACGCGAT
CCGCTGGAAA TCTCGGTTCG CCTGCCGCAG TCGGCGCGCA GCTGGGGCCA GGGCCTGGCG
GCCCTGCCCG TGGCCCAGAG CCAGGGCGGC CGGCTGGTGG AGCTGGGCGA GGTGGTCACC
GCCACCCAGG AGGCCGGCTC GACCCATATC TTCCGCCGGG ACGGCCGCGA CGTCGACATG
ATCATGGCCG AACTGGCCGG CGCCTATGAG GCGCCGATCT ACGGGATGAT GGCGGTCGAC
AAGGCGATCA AGGCCGCCGA TTGGGGCAAT GTGCCAAAGC CCGACATCCG CATGAACGGT
CAGCCCACCG ACGAGAGCAA GCCGACCGTG CTCTGGGACG GCGAGTGGGA GATCACCTGG
GTGACCTTCC GCGACATGGG CGCGGCCTTC GGGGTGGCGA TCCTCGGCAT CTATGTGCTG
GTCGTCGCCC AGTTCAAGAG CTTCCGCCTG CCGCTGGTGA TCCTGACGCC GATCCCCCTG
ACCCTGGTCG GCATCGTCAT CGGCCACATC CTGTTCAAGG CGCCGTTCAC CGCCACCTCG
ATGATCGGCT TCATCGCCCT GGCGGGGATC ATCGTGCGCA ACTCGATCCT GCTGGTGGAC
TTCATCCGCC ACAGCCAGAC CGGCGCGCGG CCGCTGCGCG AGGTCTTGCT CGAGGCCGGC
GCGATCCGGT TCAAGCCGAT CGTCCTGACC GCCCTGGCGG CCATGATCGG CGCGGCGGTG
ATCCTGTTCG ACCCGATCTT CCAGGGGCTG GCGATCTCGC TGCTGTTTGG CCTGGCTTCG
TCGACGGTGC TGACCGTGCT GGTTATTCCC GCCATCTATG TCGTCCTGCG CGATGACACC
GCCTCGCCTC ACAAGGCCTG A
 
Protein sequence
MNLGLSGRLT KASIKSPLTP LILLAAIAVG LLALISIPRE EEPQISVPMV DIIVAAPGLR 
APDAIELVGK PLETIVKSVA DVEHVYTFAD DNQVMVTARF KVGTDPDSAA VRIHEKVRAN
YDRIPAGIPE PLIQTRGIND VPSLVLTLSP KPGAAGHWTD QALYELAGKL RTEVAKVDNV
GLTFIVGGRP EEIRIAPDPA RLAQHGVSLA ALMDTVRQAT RAFPAGQIRG GGQAIDVTAG
RSLTSAVDIG LLALPSASGQ AIYVRDVADV IQGPREDQAR AWRYARVDGG WSQAPAVSLA
IAKRKGANAV VVSQAVLARV EALKGSLLPD SLDVAVTRDY GATANEKANE LLFHLGLATL
SIVVLIGLAI GWREAAVTAV VIPTTILLTL FASNLMGYTI NRVSLFALIF SIGILVDDAI
VMIENIARHW AMADGRSRID AAVDAVAEVG NPTVVATLTV VSALLPMLFV SGLMGPYMAP
IPVNASAAMV FSFFVAVVIA PWLMVRFARK TLAAGGHGHD GEGKLGALYR RVASRVIATR
RSAWTFLIGV GLATLLACAM FATKTVTVKL LPFDNKSELQ VVLDMPEGTS LEATARALSD
AAVITRALPE VTAIDAYAGT ASPFNFNGLV RHYYLRNRPD QGDLSVALAE KSERKRSSHV
VALDLRKRLA KVALPAGASI KVVEAPPGPP VMATLLAEIY GPDAKTRRAV AERVKATFKS
VPYIVDIDDS YGQPQPGLRL VPDRDRLEAL KVNDREVLDS IGAALGGQVV GYAHRGEGRD
PLEISVRLPQ SARSWGQGLA ALPVAQSQGG RLVELGEVVT ATQEAGSTHI FRRDGRDVDM
IMAELAGAYE APIYGMMAVD KAIKAADWGN VPKPDIRMNG QPTDESKPTV LWDGEWEITW
VTFRDMGAAF GVAILGIYVL VVAQFKSFRL PLVILTPIPL TLVGIVIGHI LFKAPFTATS
MIGFIALAGI IVRNSILLVD FIRHSQTGAR PLREVLLEAG AIRFKPIVLT ALAAMIGAAV
ILFDPIFQGL AISLLFGLAS STVLTVLVIP AIYVVLRDDT ASPHKA