Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4078 |
Symbol | |
ID | 5901540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4420820 |
End bp | 4421770 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564599 |
Product | anti-FecI sigma factor, FecR |
Protein accession | YP_001685701 |
Protein GI | 167648038 |
COG category | [P] Inorganic ion transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG3712] Fe2+-dicitrate sensor, membrane component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.53105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGAC CGGGTGATCA AGACAGAGAC GCGCTGATCG CCGAGGCTTC GCTGTGGCTG GCGCGCCTCG ACGCCGGTCG TGCTTCCGAA CAGGACCTGG ACGCCTGGCG CGACGCCGAT CCGCGCCGCG CGGCGGCCTT CGCCGAGGTG GCCAGCGCAT GGACGCGGCT GGACGCCCTG CGCGAGGCCG AGGACCGGCC GCTGCCGAAA CCCAGCCGTC GAGCCTGGCT GGCCGGCGGC GGCGCGGCCT TGGCGGCCAG CGTGGTCGGC GGCGCCTGGC TGGGCCGCGA CATCCTGCTG CGCGACCGCG TCGTCACCGG GGTGGGCGAG CGCCGCACCC TGGCCCTGCC CGACGGCAGC TCGGTCGAGC TCAACACCGA CACCGAGGTC TTCTGGCGGT TCGACCGCAC GCGGCGGCGG CTGTGGCTGT CGCGCGGCGA GGCGGCGTTG ATGATCGTCC ACGACCGGCT GCGGCCGTTC GAGCTGTTCA CATCCCAAGG TTTGGCCCGA TTGGCCGCTG GCCAATTCAA CGCCCGCCTG CGGCCAGCAG GGCTGGACCT GATCGTGCTG GCCGGCGAGG CGGTGGTCGA GACCGCGACG GGGGCGGCCC AGGCCCAGGT GTCCCGCCCG GCCGACGCCC GCCAGGCGCT GGAGGTCACC GCCCAGCGCA TCGCCGTGGT CGCCACGCCC GAGGCCGAGG TCCAGAGCGT CCAGGCCTGG CGGCGCGGCG AGATCGTCTT CGAGGGCCAG GCCCTGTCGG CCGCCGTCGA GGAATATAAC CGCTACCTGA CCCGCAAGCT GGTGATCGGC GACGACAAGG CCGGCCGACT GCGTCTGGGC GGGCGCTTCC TGACCGGCGA CCCCGACAGC TTCCTGGACG CCCTGCGCAC GACCTTCGGT CTACGGATCA TCGACGACGG ATCGTCGCGA ATTCTTCTTA AATCTCGATA G
|
Protein sequence | MARPGDQDRD ALIAEASLWL ARLDAGRASE QDLDAWRDAD PRRAAAFAEV ASAWTRLDAL REAEDRPLPK PSRRAWLAGG GAALAASVVG GAWLGRDILL RDRVVTGVGE RRTLALPDGS SVELNTDTEV FWRFDRTRRR LWLSRGEAAL MIVHDRLRPF ELFTSQGLAR LAAGQFNARL RPAGLDLIVL AGEAVVETAT GAAQAQVSRP ADARQALEVT AQRIAVVATP EAEVQSVQAW RRGEIVFEGQ ALSAAVEEYN RYLTRKLVIG DDKAGRLRLG GRFLTGDPDS FLDALRTTFG LRIIDDGSSR ILLKSR
|
| |