Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3571 |
Symbol | |
ID | 5901026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3854728 |
End bp | 3855861 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564079 |
Product | hypothetical protein |
Protein accession | YP_001685196 |
Protein GI | 167647533 |
COG category | [S] Function unknown |
COG ID | [COG5330] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.751929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCA CCCGCGCCGC GTTGACCGAA CACGACATCC GCATGTTGGT GAAGGGCGCG ACGCCCGACG AGCGCGCCCT GGCCGCCCAC AAGCTGTGCC GCACGATCGA CCGCGCCGAG CTGGACGCCG CTCAGCGAGC CCTGGCCGCC GACATCCTGC GGATGATGGC CGCCGACGCC GCCGAACTGG TCCGTCGGGC CATGGCCCTG ACCCTGCGCA ATTCGCCGGT CCTGCCGGTC GACGTCGCCA ATCGCCTGGC CCGCGACGTC GAGAGTATCT CCCTGCCGAT CATCGGCTTC TCGCCGGTGT TCAGCGACGC CGATCTGGCC GAGATCGTCC GGCTGGGTGG TCAAGCGCGG CAGATGGCCG TGGCCAAGCG TCCCCGCCTG TCGGCCAAGA TCGCCGCCGA GCTGGTCGAG CAGGGGGGCG AGGAGGTGGT CGCCGCCGTC TGCGCCAACG ACAACGCCCG GATGTCGGAC ACGATCCTGC AGAAGGTCCT GGATCGCTTC GCCAAGTCCG AGAAGGTGCT GACCGCCGTG GCCTACCGCG CGGTCCTGCC GCTGGCGGTG ACCGAGCGCC TGATCGACAT GGTCAGCGAC CAGCTTCGCG ACCATATCCT GGCCCATCAC GCGATCTCGG CCGAGCGCAC GCTCGAGCTG ATGACCAACA TGACCGAGCG CGCGACCATC GACCTGGTCG AACAGGCCGG TCGTTCCGCC GATCCCAAGG CCTTCGCCGC CCACCTGCAC AGCGTCGACC GGCTGTCGCC GTCCCTGCTG CTGCGCGCCC TGGGCCATGG CCACATGACC TTCTTCGAGT GGGGCGTCGC CGAGCTGGCC GGCGTGCCGC ATCATCGCAC CTGGCTGATG ATCCACGACG CCGGCGCCCT GGGCCTCAAG GCGATCTGCG AGCGGGCCGG CCTGCCGCCG CGCCTGCTGC CGGCCATCCG CGCCGGCGTC GACGCCTTCC ACGCCCTGGA ATACGACGGC CGCCCCGGCG ACCGCGAGCG CTTCCAGGAG CACATGATCC AGCGCTTCCT GACCTCGTCG GCGACGGTGT CGCGCGAGGA CACCGACTAC CTGCTGGACC GCGTCGACCG CCTGACGGAC TGGGCCCAGG TGGCGGTCGG GTAG
|
Protein sequence | MATTRAALTE HDIRMLVKGA TPDERALAAH KLCRTIDRAE LDAAQRALAA DILRMMAADA AELVRRAMAL TLRNSPVLPV DVANRLARDV ESISLPIIGF SPVFSDADLA EIVRLGGQAR QMAVAKRPRL SAKIAAELVE QGGEEVVAAV CANDNARMSD TILQKVLDRF AKSEKVLTAV AYRAVLPLAV TERLIDMVSD QLRDHILAHH AISAERTLEL MTNMTERATI DLVEQAGRSA DPKAFAAHLH SVDRLSPSLL LRALGHGHMT FFEWGVAELA GVPHHRTWLM IHDAGALGLK AICERAGLPP RLLPAIRAGV DAFHALEYDG RPGDRERFQE HMIQRFLTSS ATVSREDTDY LLDRVDRLTD WAQVAVG
|
| |