Gene Caul_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3571 
Symbol 
ID5901026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3854728 
End bp3855861 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content71% 
IMG OID641564079 
Producthypothetical protein 
Protein accessionYP_001685196 
Protein GI167647533 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.751929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA CCCGCGCCGC GTTGACCGAA CACGACATCC GCATGTTGGT GAAGGGCGCG 
ACGCCCGACG AGCGCGCCCT GGCCGCCCAC AAGCTGTGCC GCACGATCGA CCGCGCCGAG
CTGGACGCCG CTCAGCGAGC CCTGGCCGCC GACATCCTGC GGATGATGGC CGCCGACGCC
GCCGAACTGG TCCGTCGGGC CATGGCCCTG ACCCTGCGCA ATTCGCCGGT CCTGCCGGTC
GACGTCGCCA ATCGCCTGGC CCGCGACGTC GAGAGTATCT CCCTGCCGAT CATCGGCTTC
TCGCCGGTGT TCAGCGACGC CGATCTGGCC GAGATCGTCC GGCTGGGTGG TCAAGCGCGG
CAGATGGCCG TGGCCAAGCG TCCCCGCCTG TCGGCCAAGA TCGCCGCCGA GCTGGTCGAG
CAGGGGGGCG AGGAGGTGGT CGCCGCCGTC TGCGCCAACG ACAACGCCCG GATGTCGGAC
ACGATCCTGC AGAAGGTCCT GGATCGCTTC GCCAAGTCCG AGAAGGTGCT GACCGCCGTG
GCCTACCGCG CGGTCCTGCC GCTGGCGGTG ACCGAGCGCC TGATCGACAT GGTCAGCGAC
CAGCTTCGCG ACCATATCCT GGCCCATCAC GCGATCTCGG CCGAGCGCAC GCTCGAGCTG
ATGACCAACA TGACCGAGCG CGCGACCATC GACCTGGTCG AACAGGCCGG TCGTTCCGCC
GATCCCAAGG CCTTCGCCGC CCACCTGCAC AGCGTCGACC GGCTGTCGCC GTCCCTGCTG
CTGCGCGCCC TGGGCCATGG CCACATGACC TTCTTCGAGT GGGGCGTCGC CGAGCTGGCC
GGCGTGCCGC ATCATCGCAC CTGGCTGATG ATCCACGACG CCGGCGCCCT GGGCCTCAAG
GCGATCTGCG AGCGGGCCGG CCTGCCGCCG CGCCTGCTGC CGGCCATCCG CGCCGGCGTC
GACGCCTTCC ACGCCCTGGA ATACGACGGC CGCCCCGGCG ACCGCGAGCG CTTCCAGGAG
CACATGATCC AGCGCTTCCT GACCTCGTCG GCGACGGTGT CGCGCGAGGA CACCGACTAC
CTGCTGGACC GCGTCGACCG CCTGACGGAC TGGGCCCAGG TGGCGGTCGG GTAG
 
Protein sequence
MATTRAALTE HDIRMLVKGA TPDERALAAH KLCRTIDRAE LDAAQRALAA DILRMMAADA 
AELVRRAMAL TLRNSPVLPV DVANRLARDV ESISLPIIGF SPVFSDADLA EIVRLGGQAR
QMAVAKRPRL SAKIAAELVE QGGEEVVAAV CANDNARMSD TILQKVLDRF AKSEKVLTAV
AYRAVLPLAV TERLIDMVSD QLRDHILAHH AISAERTLEL MTNMTERATI DLVEQAGRSA
DPKAFAAHLH SVDRLSPSLL LRALGHGHMT FFEWGVAELA GVPHHRTWLM IHDAGALGLK
AICERAGLPP RLLPAIRAGV DAFHALEYDG RPGDRERFQE HMIQRFLTSS ATVSREDTDY
LLDRVDRLTD WAQVAVG