Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1390 |
Symbol | |
ID | 5898845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1476940 |
End bp | 1478769 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641561877 |
Product | hypothetical protein |
Protein accession | YP_001683018 |
Protein GI | 167645355 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.342186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTAAACC GCCGCCAAAT GCTGCTGTCC ACCGTCGCCG TCGGCCTGGC CGCCGCGACC CCCGGCCTTG CCCTGGCTCA GGCCGCGCCG GCCTCTGGCG AGAGCGCCAA GCTCAACGCC TTGTTCGACG CCATCATGGC CAAGCAACTG CGCCAGTCGC CCGAGACGGC GACCGGCCTG GGCCTGGATG TCGGCGACCT GGCCTGGACC AAGTCGCTGC TGGCCGACCG CTCGTTCGCC GCCATCGACG CCGGCGTCGC CCAGACCAAG CAGCAACTGG CCGACCTGCG CGCCATCGAC CGCAAGGCGC TGACCGGCAT GGACGTGGTC AATTACGACA CCGTCGAGTT CACCCTGGCC GTGCAGGACG AGGGCAATTC CAAGTTTACC TATGCCGGCG GCGGTTCGGG CGCGCCCTAC GTGCTGAGCC AACTGACGGG TTCGTACCAG TCCTATCCCG ACTTCCTCGA CACCCAGCAT TCGATCGAGA CCAAGGCCGA CGCCGACGCC TACCTGTCGC GCATGGAGGC CTTCGCGCGG CTGATGGACC AGGAGTCGGC CATCGCCAAG CGCGACTACG CCGACGGCGT GATCCCGCCG GACTTCATCC TCGACAAGGC CCTGGTCCAG ATGAAGGCCT TCGCCGGCAC GCCCACCGCC GAGGCGCCGC TGGTGCTGTC GGTGGCCCGC CGGACCAAGG AAAAGAACAT CCCCGGCGAC TGGGCCGGCG AGGCGAGCAA GATCTACGAG ACCTCGGTGC TGCCCGCCCT GAAGCGTCAG ATCGCCCTGC TGGAGAGCGT GCGGCCCAAG GCTGTCCACG ACGCCGGCGT CTGGCGCCTG AAGGACGGCG GCGACTATTA CGCCGTGTCG CTGAAGAACT ACACGACCTC GACCCTGACC CCCGACGAGA TCCACCAACT GGGCCTGGAC CTGGTCAAGT CGATCTCAGC CCAGGCCGAC AAGCTGTTCA AGAGCATCGG CATGTCCAAG GGCACCGTCG GCGAGCGGAT GGCCGAGCTG GGCAAGGACA TCTACGAGAA CACCGATCCG GCCAAGGAAC AGCTGATCGC CGACCTCAAC ACCAAGGCCA AGTGGATCGA GAAGCAGCTG CCGGCCTATT TCGGCCAGCT GCCCAAGGCT CCGCTGGAGA TCCGTCGCGT GCCCAAGGCC ATCGAGGCCG GCGCGCCGGG CGGCTACTAC AATTCGCCCT CGCTGGATGG GAAGCGGCCG GGGATCTACT GGATCAACCT GCGCGACACC AAGGAGCAGG CCAAGTACAC CCTGACCACC CTGACGGTGC ACGAGGGCGT CCCCGGCCAC CACCTGCAGC TGTCGCTGTC GAACGAGGCG CAAGGCCTGC CCCTGCTGCG CAAGACGATC GGCAACTCGG GCTATGCCGA GGGCTGGGCG CTCTATGCCG AAGAGCTGGC CGTCGAGATG GGCATCTACA AGGCCGATCC GCGCGGCCAC ATCGGCATGC TGCACGACGC CCTGTTCCGC GCCGTGCGCC TCGTGGTCGA CAGCGGCATG CACTACAAGA AGTGGAGCCG CGAGCAGGCC GTGAAGTACA TGGCCGAGAC CATGGGCGAC GAGGAAAGCG GCACGATCAC CGAGATCGAA CGCTACTGCG TCTGGCCGGG CCAGGCCTGC AGCTACATGA TCGGCAAGAT CACCTGGCTG CGGGCCCGCG AGCGGGCCAG GAAGGCGCTG GGCAAGAGGT TCGACATCCG CAAGTTCCAC GACGCGGGCA TTCTGGCGGG CATGACTCCG CTCACCGTGC TGGACAAGGT GATCGACAAC TACATCGCCG AGACCAAGGC GGCGAAGTGA
|
Protein sequence | MLNRRQMLLS TVAVGLAAAT PGLALAQAAP ASGESAKLNA LFDAIMAKQL RQSPETATGL GLDVGDLAWT KSLLADRSFA AIDAGVAQTK QQLADLRAID RKALTGMDVV NYDTVEFTLA VQDEGNSKFT YAGGGSGAPY VLSQLTGSYQ SYPDFLDTQH SIETKADADA YLSRMEAFAR LMDQESAIAK RDYADGVIPP DFILDKALVQ MKAFAGTPTA EAPLVLSVAR RTKEKNIPGD WAGEASKIYE TSVLPALKRQ IALLESVRPK AVHDAGVWRL KDGGDYYAVS LKNYTTSTLT PDEIHQLGLD LVKSISAQAD KLFKSIGMSK GTVGERMAEL GKDIYENTDP AKEQLIADLN TKAKWIEKQL PAYFGQLPKA PLEIRRVPKA IEAGAPGGYY NSPSLDGKRP GIYWINLRDT KEQAKYTLTT LTVHEGVPGH HLQLSLSNEA QGLPLLRKTI GNSGYAEGWA LYAEELAVEM GIYKADPRGH IGMLHDALFR AVRLVVDSGM HYKKWSREQA VKYMAETMGD EESGTITEIE RYCVWPGQAC SYMIGKITWL RARERARKAL GKRFDIRKFH DAGILAGMTP LTVLDKVIDN YIAETKAAK
|
| |