Gene Caul_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3923 
Symbol 
ID5901385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4243251 
End bp4245083 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content69% 
IMG OID641564444 
ProductABC transporter related 
Protein accessionYP_001685546 
Protein GI167647883 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.123801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCC CGACTCGCGC CCCTGTCCTG GCCCTCAAGG ACGTCCGTCT CGCCGATGGC 
GCCAAGCCGC TGTTCGACGG CGTCGACCTG GCCCTGGAGC CGCGCATCCG CGCCTGCCTG
GTCGGCCGCA ACGGGGCGGG CAAGAGCACC CTGCTGAAGA TCCTCGCCAA CCAGGGCGTC
GAGCCCGACA GCGGCGAACG CTCGGCCCAG CCGGGCGCCA AGATCGTCTA TGTCAGCCAG
GAGCCGGACA TCACCGGCGA GACCCTGCTG GACTACTGCA CGGCCGGCGG CGCCGAGGAC
TACGAGGCCC ACGCCACCCT GGCCGATTTC GGCCTGGACT TCCAGAAGAG CACGCAAGGC
CTGTCGGGCG GCGAGCGCCG CCGCGCGGCC CTGGCCCGGG CGTTCGCCGA GCAGCCCGAT
GTGCTGCTGC TGGACGAGCC GACCAACCAC CTGGACATCT TCGCCATCCA GACCCTGGAA
GAGGAACTGG CCAATTCCAA GTGCGCGGCC CTGATCGTCA GCCACGATCG CGCCTTCCTC
AACCGCGTCA CCCAGCGCAC CTTCTGGCTG GAGCACCGCA AGGTGCGCCG CCTGGACAAG
GGCTTCGCCG AGTTCGAGAT CTGGAGCGAA CAGGTGTTGG CCGCCGAGGC CGACGAGGAG
CGTCGCCTCA ACAAGCAGCT GGAGAAGGAA AACGCCTGGC TGGCGCGCGG CGTCCAGGGC
CGGCGGGCTC GCAACGAAGG CCGCCGCCGC GCCCTGATGG AACTGCGCGG CCAGAAGAAG
GACCTGCAGT CCGACAAGCG CGGCAGCATG ACCATGGCCG TGGAAAGCTC GGGCACGTCG
GGCAAGCGCG TGGTCGAGGC CAAGCACGTC ACCAAGCGCT TCGGCGACCG GACGATCATC
GAGGACTTCT CGACCCGGAT CCTGCGCGGC GACCGCGTGG CCCTGGTCGG CCCCAACGGC
GCGGGCAAGA CCACCCTGGT GCGGATGCTG CTGGGCGAGA TCCCGGTCGA CGAGGGCACG
GTGCAACTCG GGACCAATCT CGAGATCTCC TATGTCGACC AGAACCGCAT GGCGCTGTCG
GAGAAGATGA CGCTGTGGGA CTTCCTGACC CCCGGCGGCG GGGACTCGAT CCTGGTGCGC
GGCATCCCCA AGCACGTGGC CGGCTACGCC AAGGAATTCC TGTTCACCGA GGCCCAGCTG
CGCCAACCGG TCACCAGCCT GTCGGGCGGC GAGCGCAACC GCCTGCTGCT GGCCCGCGCC
CTGGCCACGC CGACCAACCT GATGGTGCTC GACGAACCGA CCAACGACCT GGACATGGAC
ACCCTGGACC TGCTGGAAGA GGTTCTGGCC GATTTCGAGG GAACCCTCAT TCTCGTCAGC
CACGATCGTG ACTTCATCGA CCGCCTGGCG ACCTCGACCA TCGCCATGAA CGGCCACGGC
AAGGTGCTGG AGACCCCCGG CGGCTGGACC GACCTGATCG ACCAGGCGCC GGACTTCTTC
AAGGGCGCCC GCGGCGCGGC CGGCGACTTC GCCACCGGCA CGGGCTCGAA CAAGGCGGTC
ATCACGCCGA CGGCCCCGCC GGTTCCGAAG AAGACCGTCA AGCTGTCGTA CAAGGATCAA
CGCCGCCTGG AGGAATGCGA GGCCCTGGTC GCCGCGTCGC CGAAGATCAT CGCCGACCTG
GAAGCCAGGC TGGCCGACGC CAGCTTCTAC GCCAAGGACC CGGCGGGCTT CGGCAAGGTG
ATGAAGGAAC TCGACAAGGC CCGGGCCGAC CTGGCGACGG CCGAGGAGGA CTGGCTGGCG
CTGGAGGAAA AGCGCGAGAC GATGGCGGGG TAA
 
Protein sequence
MASPTRAPVL ALKDVRLADG AKPLFDGVDL ALEPRIRACL VGRNGAGKST LLKILANQGV 
EPDSGERSAQ PGAKIVYVSQ EPDITGETLL DYCTAGGAED YEAHATLADF GLDFQKSTQG
LSGGERRRAA LARAFAEQPD VLLLDEPTNH LDIFAIQTLE EELANSKCAA LIVSHDRAFL
NRVTQRTFWL EHRKVRRLDK GFAEFEIWSE QVLAAEADEE RRLNKQLEKE NAWLARGVQG
RRARNEGRRR ALMELRGQKK DLQSDKRGSM TMAVESSGTS GKRVVEAKHV TKRFGDRTII
EDFSTRILRG DRVALVGPNG AGKTTLVRML LGEIPVDEGT VQLGTNLEIS YVDQNRMALS
EKMTLWDFLT PGGGDSILVR GIPKHVAGYA KEFLFTEAQL RQPVTSLSGG ERNRLLLARA
LATPTNLMVL DEPTNDLDMD TLDLLEEVLA DFEGTLILVS HDRDFIDRLA TSTIAMNGHG
KVLETPGGWT DLIDQAPDFF KGARGAAGDF ATGTGSNKAV ITPTAPPVPK KTVKLSYKDQ
RRLEECEALV AASPKIIADL EARLADASFY AKDPAGFGKV MKELDKARAD LATAEEDWLA
LEEKRETMAG