Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0589 |
Symbol | |
ID | 4026248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 654895 |
End bp | 657045 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637965757 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_572650 |
Protein GI | 92112722 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.115541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGG AAATCGAAAC CCAGCGCAAT TTCAATCCCG GCACCAACGA ACCGTTCTCG TCGGTGCTGG CTCGACATGT CTCGCGACGC GACATCATGC GGGGAGGCCT GAGTCTGGCC GCTCTGGGCA TGTTGGGCGG CATCGGCGGG CTCAACCTCG CCAATGCCCA GGAAAGCGGG AGCGGCAAGA CGCCGCTGGC CCTGGCGTTC GAGTCGCTGC GCGGCTCGCG CACCGATGCC GTGGTGGTGC CCGAGGGCTA TACCGCTCAG GTCCTGGTCC CCTGGGGAAC GCCGCTGAAC GGCGCGGCGG CGGAGTGGAA GCACGACATG CGCATGACCG CCGAGGCGCA GGCCGAGCGC GTGGGCATGC ATCACGATGG CATGCATGCC TTTGCGCTCG ATGCGGACAA TGCCTCGCGG GATTTCCTGC TCGCCCTCAA CAACGAATAC ATCGATCAGG CGGCATTGTG GGCGCCGCAG GGCGGGCCGA CGAATGCCGA CAGCGGCAGG CGGCCGGCCG ATGAGGTGCG CACCGAGATC AACGCGCACG GGATCACCCT GGTCAGGGTA CGCAAGGACG CCGAGGGGCG CTGGACGCAT GTGCCGGACG ATGCCCGCAA CCGGCGCTTC ACCAGTGCCA CGCCGATGGA GCTGGCGGGC CCCGTGGCCG GCAGCGATTA CGTCAGGACG CGTTATTCCC CGGAAGGCAC GCATACACGC GGCACCAACA ACAACTGTGC CAACGGGTAT ACCCCATGGG GCACCTACCT GACCTGCGAG GAAAACTGGC CGGAGGTGTT CGTCAATACC GGCGAGCGCC ACGCCGACGA TGCGCGCCTG GGGCTGCCCA CCGAGCGCGG GCGCTACGGC TGGGAGACCG CGGCCGGCAG CGAGGGCGAG CGCGACGACG AGTTCGCGCG ATTCGATGTC ACCCCGCGTG GCAACGGGCC GCGGGACGAC TATCGCAACG AGACGCGCAC CTTCGGCTAT ATCGTGGAGA TCGATCCGTA CGATGGCCAG TCCCGCGCCG TGAAGCGCAC CGCGCTGGGA CGCTTTCGCC ACGAGGGCTG CTGGCCGGGC AAGCTGGTCG CCGGCGAGCC GGTCGTCTTC TATTCCGGGC ATGACGCCCG CAACGAGTAC ATCTACAAGT TCGTGTCCGA CGCCAAGTGG GACCCGGCGG ATGCCAACCG CCCCGGCGAG GCATATGACC GTCTCGCGCT GGGCGCCAAG TACATGGACG AGGGAACGCT TTACGTGGCG CGTTTCGATG CCGATGGCGG CGGGGAATGG CTGGCGCTCG AGCCGGACAC GCGCGTGGCG GACGGCCGCA CGCTGGCCGA GGCCTTGTCG CTGGCGGAAG ACGACCGCGC GGGCGTCATC GTGCATACCT GCGACGCCGC CGACCTGCTC GGCGCGACAC CCATGGACCG TCCCGAGTGG GGGACCGTGG ACCCGAGCTC CGGCGAGGTC TACATGACGT TGACCAACAA TTCCCGGCGC ACCGAGGCCG ATTCGGCACC GACCTTCACC AATGAAGGCG ACGCCATCGA GGCGGCAGGC GTGGGTTATG CCACGGCCCC TGCCAATGCC GCCAACCCGC GCGCCGACAA CGAGGCGGGG CAGGTGATCC GCTGGCGGGA GCGGGGCGAC GACGCCACCC GTTTCGACTG GGAGGTGTTC GTGTTCGGCG CCGCCGCAGA CGATGCCGAC AACCTGTCGG GGCTGACCGA GCGCAACCAG TTCGCCAGCC CCGATGGCCT GTGGTACGAC GATCGCGGTG ACGGCCAGGG CATTCTGTGG ATCGAGACCG ACAATGGCTA CGACGGTGTC GCCGAGCAGA CCAACGACCA GGTGCTGGCG GTGGTACCGG CGGCTCTGTC ACGCGAGAGC GGCCAGGCGT CGGTGGTGGG GGCCGCCAAC CAGCAGCAAC TCAAGCGGTT TGCGGTGGGC CCCAACGATT GCGAGGTGAC CGGCATCTTC GCCACGCCGG ACAAGACGGC GCTGTTCATC AACATCCAGC ATCCCGGCAA TTGGCCGGCG GATCCCGATG CGGTGACGCA GGATGCCACG AAGGTTGCCG CCGGTAGCGT GCGCCCGCGG GCCGCCACGG TCGTGATCCA GAAAGCCGAC GGTGGGCAGA TCGGCGTGTA G
|
Protein sequence | MSKEIETQRN FNPGTNEPFS SVLARHVSRR DIMRGGLSLA ALGMLGGIGG LNLANAQESG SGKTPLALAF ESLRGSRTDA VVVPEGYTAQ VLVPWGTPLN GAAAEWKHDM RMTAEAQAER VGMHHDGMHA FALDADNASR DFLLALNNEY IDQAALWAPQ GGPTNADSGR RPADEVRTEI NAHGITLVRV RKDAEGRWTH VPDDARNRRF TSATPMELAG PVAGSDYVRT RYSPEGTHTR GTNNNCANGY TPWGTYLTCE ENWPEVFVNT GERHADDARL GLPTERGRYG WETAAGSEGE RDDEFARFDV TPRGNGPRDD YRNETRTFGY IVEIDPYDGQ SRAVKRTALG RFRHEGCWPG KLVAGEPVVF YSGHDARNEY IYKFVSDAKW DPADANRPGE AYDRLALGAK YMDEGTLYVA RFDADGGGEW LALEPDTRVA DGRTLAEALS LAEDDRAGVI VHTCDAADLL GATPMDRPEW GTVDPSSGEV YMTLTNNSRR TEADSAPTFT NEGDAIEAAG VGYATAPANA ANPRADNEAG QVIRWRERGD DATRFDWEVF VFGAAADDAD NLSGLTERNQ FASPDGLWYD DRGDGQGILW IETDNGYDGV AEQTNDQVLA VVPAALSRES GQASVVGAAN QQQLKRFAVG PNDCEVTGIF ATPDKTALFI NIQHPGNWPA DPDAVTQDAT KVAAGSVRPR AATVVIQKAD GGQIGV
|
| |