Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2373 |
Symbol | |
ID | 5899828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2577858 |
End bp | 2579558 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641562864 |
Product | hypothetical protein |
Protein accession | YP_001683998 |
Protein GI | 167646335 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCA ACCTCGCCTT GGCCGAACTG GCTGCGCGCT ATTGGGCGTT TCAATGCGAA GAGTTCCCGA TCAACGCGAT CGCGGCTGGG GCGGCGACCA CAGCCTCACA GCTGATGCGG GAGGCGCCGG CGGATCATGA GCGGCGCGCC GCTTGGGCGC GGACCGCCCG CGACGCGCTC TTGGCGATCG ACGTCGGATC GCTCGAGATT GACGACACCG CGACACATCA ACTGCTCGAT CATGAGCTTC GTCTCACGAT CGAACTGGTC GAAAGCGGCG CACACTTGCG CCCCACGATC TATCCGCTCG GCCCGGAATT CACACTGATC TACTGGGCGA ATTCGACGGC CCTGGCCACC GCGACCGATG CGCGACTTTA TCTGGCCAGG CTCGCCGCGA TACCTGCATC GTTCGAGACC GTTCAGGCGG GATTGGCCCA AGGGGTGGCC CAAGGGATGT CCTATCCGCG CCTCGTCGTG GAACGTGCCG TCGCGCAGGT CCGCGGACAG ATCTCGGCGG CTCTGGAGGC GGACCCCTTC TACAGTCCTC TGGGCCGGGC GGCAGCCCGG GGGGGCGTGA TGGAGGATTT GGCGGGCGAG GGGCGTGCAC TGGTGGAAGA GGTCGTGCGA CCCGCCTTTC TCGCCTACGC GGACTTTCTC GAAAGCACGG TGCTGCCGGT ATCGCGCGAG AGCATCTCGG GCGCCGACGA CGTCGATGGC GAGCGTTTCT ACCGCTACAA TATCAACCAA TATACGACGG TGGATCTACC GCCCGAGGCC ATCCACGCCA CCGGGCTGGC GGAGGTCCAG CGTCTCAAAG GCGAGATGCA GGCCGTCGCC AGCGATGCGG GCTTCCCCAA TGACATCGAA GGCTTCCGTG ACCGCCTGAA GACCGACAAC CGGCAATTCG CCGAAAGTGG GGAAGCATTG CGCGAGCAGA TCGAGATTCT GTCAAAACGC ATCGATGCGA GGATCCCGGA ATTCTTCGGG CGAATACCCC GCATCAGCTA CGGCGTGAGC AGCATTCCCG AAGCCATCGC CGAGAGAATG CCTCCGGCCT ACGCCCAGCC CAATCCGGCC GACGGCAGCG CGGCGGGCGT CCACTGGATC ACGTCGATCC CCAGCAAATG TCCAAGCTAC ATGCACTTGC CCTTGGCGCT GCACGAGGCC TGGCCCGGTC ATCTGATGCA TCTCGCCTTG ATCCAGGAGA TGGATCAACT TCCCGACTTC CGCCGCTACG GGGCCATGAA ATACTCCGCC TGCCTTGAAG GCTGGGCGCT TTATTGCGAG GCGTTGGGCG AAGACATGGG CTTTTACGAT ACGCCGGAGA AGCGGTACGG ACGCCTAGAG ATGGAGATGT GGCGCGCGGT GCGGCTGGTC GTGGACACCG GAATTCATTC TGGAGAATGG AGCCGCGATC AGGCTATTTC CTTCTTCCAG GACAATATGG CGATGCCGCT CGAGACGATA ACGGCCGAGG TCGATCGCTA CATCGGTTTG CCTGGGCAGG CGCTCGCCTA TCAGCTCGGC AATCTCAAGT TTCGCGAGCT TCGCGCCCGC GCGCAGGCGG CTCTCGGCGA GGATTTTCGG ATCCGCGATT TTCACGACGC CCTGATGGCG GCCGGCGCCG TGACGCTGCC TGTGCTTGAG ATGCTGATGG ACGACTGGAT CGCCGACGCG AAGGTTGCCG TGGCCGCATG A
|
Protein sequence | MDTNLALAEL AARYWAFQCE EFPINAIAAG AATTASQLMR EAPADHERRA AWARTARDAL LAIDVGSLEI DDTATHQLLD HELRLTIELV ESGAHLRPTI YPLGPEFTLI YWANSTALAT ATDARLYLAR LAAIPASFET VQAGLAQGVA QGMSYPRLVV ERAVAQVRGQ ISAALEADPF YSPLGRAAAR GGVMEDLAGE GRALVEEVVR PAFLAYADFL ESTVLPVSRE SISGADDVDG ERFYRYNINQ YTTVDLPPEA IHATGLAEVQ RLKGEMQAVA SDAGFPNDIE GFRDRLKTDN RQFAESGEAL REQIEILSKR IDARIPEFFG RIPRISYGVS SIPEAIAERM PPAYAQPNPA DGSAAGVHWI TSIPSKCPSY MHLPLALHEA WPGHLMHLAL IQEMDQLPDF RRYGAMKYSA CLEGWALYCE ALGEDMGFYD TPEKRYGRLE MEMWRAVRLV VDTGIHSGEW SRDQAISFFQ DNMAMPLETI TAEVDRYIGL PGQALAYQLG NLKFRELRAR AQAALGEDFR IRDFHDALMA AGAVTLPVLE MLMDDWIADA KVAVAA
|
| |