Gene Caul_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2554 
Symbol 
ID5900009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2775760 
End bp2776866 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content47% 
IMG OID641563045 
Producthypothetical protein 
Protein accessionYP_001684179 
Protein GI167646516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.118621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGG CCGAAATCGA TGAGGCGAAG CGCCTCGTCA CTACCGATAC AGTTCAAATT 
ACGATCGGCG AGATCGCGTC GATGTATGCC TCTGACGAGC TTGATATTAT ACCGGAATTT
CAAAGATTGT TCCGTTGGTC TATAGAGAAA AAATCTAGTT TTATAGAATC CATCCTGATT
GGAATTCCTG TCCCCCCCGC GTTTGCCTAC GAAAATGCGG ATGGCACATG GGAGCTTATT
GACGGCCTTC AAAGAATTTC TACCATTCTT GAATTCATGG GACTCCTGCG CGATCTTGAT
AATCCCGGAT CATTTAGACA GTCGACCCTT ATGGAAACAA AGTATCTAAA ATCTCTCCGA
GACGCGAGAT GGTCTCCATT TGATCTTGAC GCTACAAATG TACTTGATAA ATCTCTTCAG
CTATTTTTCC GACGAGCCAG AATAGACTTT CAAGTTCTAA AGCATCCGAG TGATCCGCGC
ACGAAATTTG ATCTTTTTCA GCGACTTAAC CGCGGCGGCG CTTACGCCAA TGAGCAGGAA
GTTAGAAGCT GCTCGATGGT TCTCGCGGAT CGCGAATTTA CAAAGGAAAT CAAAAATTTT
GCAGATAGCG ATATATTTCG GAAAGTGTTT AAAATTACCC CAGAACAATC TATAAATCAG
AAAAACGTTG AATACGCTGT TAGGTTGATC GTTCATACAT TCAGAGACTT TACCAGCGGC
ACAGATGTAC AAGAGTTCCT AGACAAATCT ATAATTAGCA TCATGACGGA AGAAAATCAG
GCCGCCGTGA TGGAGACAAT TCGCTGGACG GTGGAGACCC TGAGTCGAGC GGCTGGGGAT
GGAGTATTGG TGCCGCCAGC CGACGCTCCT GAAGAAATTG CCAATCGTTT TTCTCTCCGC
GCGCTAGAGG CGATAGCATC TGGCCTGGCA AGAAACCGGG AGGCTGTATC AAGGCTTCCC
GATCAAGATG CATTCGTTCG GGAGAGAATC TCTGGATTTT GGCAGCAGGA ATCAGTGCTC
CAAATGAGCG CCTCCGGCTT GCGTGGAACC ACTCGTATTC AGCGCACTGT TCTTTTTGGA
GAATCTTGGT TTAAACCCGA TGCCTGA
 
Protein sequence
MLKAEIDEAK RLVTTDTVQI TIGEIASMYA SDELDIIPEF QRLFRWSIEK KSSFIESILI 
GIPVPPAFAY ENADGTWELI DGLQRISTIL EFMGLLRDLD NPGSFRQSTL METKYLKSLR
DARWSPFDLD ATNVLDKSLQ LFFRRARIDF QVLKHPSDPR TKFDLFQRLN RGGAYANEQE
VRSCSMVLAD REFTKEIKNF ADSDIFRKVF KITPEQSINQ KNVEYAVRLI VHTFRDFTSG
TDVQEFLDKS IISIMTEENQ AAVMETIRWT VETLSRAAGD GVLVPPADAP EEIANRFSLR
ALEAIASGLA RNREAVSRLP DQDAFVRERI SGFWQQESVL QMSASGLRGT TRIQRTVLFG
ESWFKPDA