Gene Caul_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2105 
Symbol 
ID5899560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2252424 
End bp2254181 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content69% 
IMG OID641562594 
ProductMotA/TolQ/ExbB proton channel 
Protein accessionYP_001683731 
Protein GI167646068 
COG category[S] Function unknown 
COG ID[COG5306] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.794323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATGC TTGCGGTGAC GCTCGCGCTG ACGGTCACAG CCGTTCCCAC CGGAGCCTTG 
GCTTGGTGGA AGAAGGAGTG GTCCTACCGC ACCCAGATTC AGGTGGACAC CACCGCTTCC
GGCGTGCCGA TCTCTGGCTC GGTCGGGCGC GCGCCCCTGC TCCTGCGACT GCATACCGGC
AATTTCCGCT TCGAGGACGC CGCCGACAAC GGCGCCGACC TGCGCTTCGT GGCCGCCGAC
GACAAGACGC CGCTGGCCTA TCACATCGAA AGCTTTGATC CGCTGCTCGG CGTGGCCACC
GTCTGGGTCG ATGTGCCGAA GTTCCCGGTC GGCGCCAAGC AGACGATCTG GATGTACTAC
GGCAACAAGA CCGCCACGGC GGCGATCGAC ACGGCCGGCA CCTTCGATCC GGACTACGCA
GCCGTCTATC ACTTCGACAC CGCCGCCGGC GTGGCCCCCA AGGACAAGAC CGCCAACGGC
AACGACGCTC GCACCGGCCC GGCGGCCGTC GATGACGGCG CGGTGATCGG CCGCGGCGCG
CGCTTCACCG GCTTGGCCCC GATGGAAGTG GGCGCTTCGC CCTCGCTGAC CCTGGCCCCC
GGCGCTCCCT TCACGATCAG CGCCTGGGTC CGTCCCGACG CACTGGCCGG CGCGGTCGCT
TTGCTGGCTC GCCGTGAAGG AAACCAGGCC CTGGTCCTTG GACTTGACCA GGGCGCGCCG
TTCGTCGAGG TGTCCGGCCA GAGGATCGCC TCGACCGCGG CGCTGGTCCA GGGCCGCTGG
AGCCACCTGG CCGTCACCTC CGACGGACGC ACGGTCACCC TTTATGTCGA TGGCCATTCG
ACGGTGTCGA CCCCTGCGGC CACCCCGGCC CTGAGCGGCC CCATCGCGAT CGGCGGCGAC
GCCCCGGGGG GCGTTCTGGC CGGCTTCCAG GGCGGGCTCG ACGAGCTGCG GCTGTCGAAG
GCGGCCCGCT CGGCGGCCCT GATCCAGATG GACGCCGCCT CGCAGGGCGC GGAGTCGAAG
CTGACGGTGT TCGCCGGCGA CGAGAAGCAG TCGGGCTTCG GCTTTGGTCC TTTCGGCGTG
ATCGTCAAGT CCGTCACGGT CGACGCCTGG GTGATCATCG GCATCCTGGG CGTGATGGCG
GCCGCCTCGT TCTGGGTGAT CTGGAGCAAG TCGATGTACG TCGGCTCGGT CGACGCGGCC
AACGAACGCT TCGTCAAGCT GTTCCGCGAG CAAGGCGACG ACTTCCTGGC CCTGGCGCGC
GGCGGCGAGC CGCGCGGCGT GGCCCAGTCG TCGATCTATC GCATCTATCG CGCCGGCGGC
GATGAGATCA CGCGTCGCCA CCTGGCCAGC GGCGCCATGC TGAGGGCCGA GGCGATCCAG
GTGATCCGCG CCCTGATGGA CGCCACTCTG GTGCGCGAGA ATCAGAGGCT GTCCAAGTCG
CTGGTGGTCC TGACCATCGC CATTTCCGGC GGTCCGTTCC TGGGTCTTCT GGGTACGGTG
GTCGGGGTGA TGATCACCTT CGCGGCGATC GCCGCGGCCG GTGACGTCAA CGTCAACGCC
ATCGCCCCCG GGATCTCGGC GGCGCTGCTG GCCACGGTGG CGGGCTTGGG CGTCGCGATC
CCCTCGCTGT TCGGCTACAA CTATATCCTC ACCCGCAACA AGGCCGTCCA GGCGAACATG
ATCGTGTTCG TCGACGAGTT CATCACGCGG GCCTCCGAAC GCTACAGCGG CGAAGCCTTC
GCCGCGGCCG CCGAATAG
 
Protein sequence
MLMLAVTLAL TVTAVPTGAL AWWKKEWSYR TQIQVDTTAS GVPISGSVGR APLLLRLHTG 
NFRFEDAADN GADLRFVAAD DKTPLAYHIE SFDPLLGVAT VWVDVPKFPV GAKQTIWMYY
GNKTATAAID TAGTFDPDYA AVYHFDTAAG VAPKDKTANG NDARTGPAAV DDGAVIGRGA
RFTGLAPMEV GASPSLTLAP GAPFTISAWV RPDALAGAVA LLARREGNQA LVLGLDQGAP
FVEVSGQRIA STAALVQGRW SHLAVTSDGR TVTLYVDGHS TVSTPAATPA LSGPIAIGGD
APGGVLAGFQ GGLDELRLSK AARSAALIQM DAASQGAESK LTVFAGDEKQ SGFGFGPFGV
IVKSVTVDAW VIIGILGVMA AASFWVIWSK SMYVGSVDAA NERFVKLFRE QGDDFLALAR
GGEPRGVAQS SIYRIYRAGG DEITRRHLAS GAMLRAEAIQ VIRALMDATL VRENQRLSKS
LVVLTIAISG GPFLGLLGTV VGVMITFAAI AAAGDVNVNA IAPGISAALL ATVAGLGVAI
PSLFGYNYIL TRNKAVQANM IVFVDEFITR ASERYSGEAF AAAAE