Gene Caul_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3152 
Symbol 
ID5900607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3416082 
End bp3417482 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content65% 
IMG OID641563655 
Producthypothetical protein 
Protein accessionYP_001684777 
Protein GI167647114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.416399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG ACATGGAAGC AGCGGCGGCG GACTCGTTCG CCGACCTGTT CAAGCGACGG 
AAGAAGCACC CGTTCACGGG TGTGGGCCAG CCTTGCGCCA ACTGCGGCGC GGCGCTGGAA
GGTCCGTACT GCCATGAGTG CGGCCAGAAC GCCGACAACC ACAAGCGCTC GATCTTCCAT
CTGATCATCG AGGCCATCGA GGGGATGTTC CACCTCGACG GACGCCTGGC CCTGACCTTG
CCGGCCCTGT TCTTTCAGCC CGGCAAGCTG GCCAAGGATT ACATGGAGGG GCGTATCGTG
CGGCACGTTC CGCCGTTCCG CACCTTCCTG GTCGCCCTGC TGCTGTTCAT CTTCGCGGCC
GAGCACGCCA TACATTCATT CAAGCACCAC GCCGAGGAAG AGACCCACAA GCGCGCCGAG
GCCCTGGCCA CGCCGCAGGG CCGGGCCGCC GAGGCGGGCA GGATGCGCGT CGAGGCCGCC
AAGGACCGCG CTTCACGGCT GAAGGGAGCC GCCGAGGATC GCGACGCGGC CCTGGCCGAG
GCGGCCAAGG ATCGCGACGA GGCGTTGAAG GATCCCGATC AGGACAAGGC CAAGACCCTG
CAGGCCTACC AGGAAGCCGT CGCCAAGGCT CCGGTCGACT ATCAGGAAGA GGCTGACGAG
GCCCAAGCCC GCTATGCCAA GAGCATCGCC GACGCCGATC ACCTTCAGAA CAACCCGCTG
GCCGCCAAGG AAATCCTCGA GGCCGACGAG AAGATGCGCA AGAAGACGGC CGACGCGATC
CGTGGGGCTA AGGTCACGAG CGTGTTCGAC GGCGACGAGG TCGAGTCCCA GGCCACCAAG
TTCGCCGATG ACGCGACCGA CACCATGACG GTCGGCGGCG TCAACATAAA GACGCCCAAG
ACCGAGGTCG GTCCCGCTGA AGCCGACCAC GGCGGCATCG CGGTCCCGGT CACGGGCGGC
GCGCACGGCA AGGAGCACTG GCTCAAGGCC GGCCTGATCA AGGCGCTGGA GAATCCCGAA
TACTACATGC TGGTGATGTT CGGCTGGGGC CACCGCCTGG CGGTGCTGCT GCTGCCGATG
CTGGGACTCA GCCTGGCCCT GGTCTACGTC AACAAGCGGC AGTTCTTCAT CTACGACCAT
CTGATCGTGG CCACCAACCT GCTGTCGTTC GCGTTCCTGA CCAATGCGCT GGGCCTGGTG
CTGCCCGATC CGGTCCGCAA GTGGTGGTTC CTGCTGCTCA TGGTCTGGAC GCCGATCAAC
CTGTTCCAGA CCCTGCGCGG GGCCTACGGT TCCAGCATTC CCGGGGCCAT CATCAAGACG
CTGATCGTCT GGTGGTCGAC CATGTTCTCG TTCGTTTTGC TGCTCAGCGT TCTACTGGTT
TTCGCCTTGG CCCAGATTTA G
 
Protein sequence
MTADMEAAAA DSFADLFKRR KKHPFTGVGQ PCANCGAALE GPYCHECGQN ADNHKRSIFH 
LIIEAIEGMF HLDGRLALTL PALFFQPGKL AKDYMEGRIV RHVPPFRTFL VALLLFIFAA
EHAIHSFKHH AEEETHKRAE ALATPQGRAA EAGRMRVEAA KDRASRLKGA AEDRDAALAE
AAKDRDEALK DPDQDKAKTL QAYQEAVAKA PVDYQEEADE AQARYAKSIA DADHLQNNPL
AAKEILEADE KMRKKTADAI RGAKVTSVFD GDEVESQATK FADDATDTMT VGGVNIKTPK
TEVGPAEADH GGIAVPVTGG AHGKEHWLKA GLIKALENPE YYMLVMFGWG HRLAVLLLPM
LGLSLALVYV NKRQFFIYDH LIVATNLLSF AFLTNALGLV LPDPVRKWWF LLLMVWTPIN
LFQTLRGAYG SSIPGAIIKT LIVWWSTMFS FVLLLSVLLV FALAQI