Gene Caul_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3414 
Symbol 
ID5900869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3685824 
End bp3687491 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content63% 
IMG OID641563920 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001685039 
Protein GI167647376 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.106241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.225653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGC AATATATTTT CCAGATGCAG GGCCTGACTA AGGCCTTCCC CGGCGGCAAG 
AAGATCTTCG AGAACATCTG GCTGTCGTTC TATTCCGACG CCAAGATCGG CGTCGTCGGC
GTCAACGGTT CGGGCAAGTC GACCCTGCTG AAGATCATGG CCGGGCTAGA CGACCAGTTC
TCGGGCGAGG CCAAGGCCGC CGAGGGCATT CGTCGCGGTT ACCTGCCGCA AGAGCCGGTG
CTGGACCCCA CCAAGGATGT CTGGGGCAAT GTCATCGCCG ACTGCGAAGA CAAGCTGATC
TTCGACAAGT ACAACGAGAT CGCCAACAAG CTCGGCGAGG ACTATTCCGA CGCGCTGATG
GAGGAGATGA CCAAGCTCCA GGAGATCATC GACGCCCGCG ACCTGTGGGA CATCGACTCC
AAGGTCGAGA TGGCCATCGA CGCCCTGCGC TGCCCGCCCA ACGACGCCAA TATCGAAAGC
CTGTCGGGCG GTGAAAAGCG CCGCATCGCC CTGGCCCGCC TGCTGCTCAG CAAGCCCGAC
ATCATGCTGC TCGACGAACC CACCAACCAC CTGGACGCCG AGTCGGTGGC CTGGCTGCAG
CACCACCTGG AAAACTTCCC GGGCTGCGTG ATCCTGGTCA CCCACGATCG CTACTTCCTG
GATCAGGTCA CCAAGTGGAC CCTGGAACTG GATCGCGGCA AGGGCATCCC CTACGAGGGC
AACTATTCCG GCTGGCTGGA GCAGAAGACC AAGCGCGTCG TGCAGGAGCA GTCGGAGTCC
GACAGCCGTC AGCGCGCGCT GACCCGCGAA CTGGAATGGG TCCGCAGCTC GCCCAAGGCC
CGCCAGTCCA AGTCGAAAGC CCGCCTGGCG AGCTACGAGG AAATGGTCGC CGCGCAGGAG
AACGCTCGCG CCGCCCAGAC CCAGGCCCAC ATCCAGATCC CCCCCGGCCC GCGCCTGGGC
AACCTGGTGC TGGAAGTCGA GAACCTGGAA AAGGAATATG GCGACAAGGT GCTGTTCAAG
AACCTGTCGT TCCGCCTGCC GCCCAACGGC ATCGTCGGCG TCATCGGCCC CAACGGCGCC
GGCAAGTCGA CGCTGTTCAA GCTGATCACC GGCCGCGAGC AGCCCGACCA GGGCACGGTC
AAGGTCGGCG AGACCGTGAA GCTTTCGTAT GTGGACCAGT CGCGCGACGC CCTCGACCCG
AACAAGACCA TCTGGGAAGA GATCAGCGGC GGCACCGACG TGATGATCGT CGGCAAGCGC
GAGATCAACT CCAGAGCCTA TGTCGGCAGC TTCAACTTCA AGGGCGGCGA CCAGCAGAAG
AAGGTCGGCC TGCTGTCGGG CGGGGAGCGC AACCGCGTCC ACCTGGCCAA GACCCTGGCC
ACCGGCGGCA ACCTGCTGCT GCTCGACGAA CCGACCAACG ACCTGGACAT CGAAACCTTG
CAGGCCCTGG AAGAGGCCCT GGAAGAGTTC GCCGGCTGCG CCGTGGTCAT CTCCCACGAT
CGCTGGTTCC TAGACCGCCT GGCCACCCAC ATCCTGGCCT TCGAGGGCGA CAGCCACGTC
GAATGGTTCG AGGGCAACTT CGAGATGTAC GAGGAAGACA AGAAGCGTCG CCTGGGCGCC
GACAGCCTGA TCCCCAAGCG CATCAAGTTC CAGAAGTTCG CGCGGTAG
 
Protein sequence
MAQQYIFQMQ GLTKAFPGGK KIFENIWLSF YSDAKIGVVG VNGSGKSTLL KIMAGLDDQF 
SGEAKAAEGI RRGYLPQEPV LDPTKDVWGN VIADCEDKLI FDKYNEIANK LGEDYSDALM
EEMTKLQEII DARDLWDIDS KVEMAIDALR CPPNDANIES LSGGEKRRIA LARLLLSKPD
IMLLDEPTNH LDAESVAWLQ HHLENFPGCV ILVTHDRYFL DQVTKWTLEL DRGKGIPYEG
NYSGWLEQKT KRVVQEQSES DSRQRALTRE LEWVRSSPKA RQSKSKARLA SYEEMVAAQE
NARAAQTQAH IQIPPGPRLG NLVLEVENLE KEYGDKVLFK NLSFRLPPNG IVGVIGPNGA
GKSTLFKLIT GREQPDQGTV KVGETVKLSY VDQSRDALDP NKTIWEEISG GTDVMIVGKR
EINSRAYVGS FNFKGGDQQK KVGLLSGGER NRVHLAKTLA TGGNLLLLDE PTNDLDIETL
QALEEALEEF AGCAVVISHD RWFLDRLATH ILAFEGDSHV EWFEGNFEMY EEDKKRRLGA
DSLIPKRIKF QKFAR