Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3959 |
Symbol | |
ID | 3936440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 4057674 |
End bp | 4058504 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637906337 |
Product | extracellular solute-binding protein |
Protein accession | YP_511901 |
Protein GI | 89056450 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.406686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTAT CCCGCAGAGT ATTCGCGGCC AGCCTCGGAG TTGCCGCCGC AATGGCAGCC CTGCCCGCCA CAGCCCAAAG CGCGGCGCAA TCGTTGGCCA GCGAAAGCGT CCTGACCACC ATCCAGGAAG AAGGCGTGAT CCGCATCGGC CTGTCCATTT TCACGCCATG GTCCATGCGT GACGTCAACG GAGAGCTGAT CGGCTTCGAG TTGGACGTGG GCCGCGCGCT GGCCGAGGAC ATGGGTGTCG AGGTGGAATT CGTGCCCACC GCCTGGGACG GCATCATCCC CGCGCTTCTG GCGGGCAATT TCGATGTCAT CATCTCGGGC ATGTCGATCA CGCCCCAGCG CAACCTGACG GTCAACTTCA CCGATCCCTA CGCCTATTCC GGCATGGCGA TCCTGGCCAA TACCGCCATG ACCGAAGGCA TGACGATGGA CGATTACAAC TCGCCCGACA TCACCTTCGC CGCCCGCCGT GGGGCCACGC CCGCAACCGT CATCCAGAAC CGCTTCCCCG AGGCTGAGTT GCTGCTGTTC GACGAGGATG GCGCCTCGAC CCAGGAGGTT CTGAACGGCA ATGCTCACGC CACCATGGCG TCCCAGCCCA CGCCGGACCG GGAAGTGCGC CTGAACCCGG AAACGCTGTC AGTGCCCTTT GATGAGTTGA TCGACCCCAC GGGCGAAGCC TTCGCTGTGC GCAAGGGCGA CCCGGACGCG ATGAACTTCT TCAACAACTG GATCGCGGCG CGCACGCGCT CTGGCTGGCT GGAAGAGCGT CACGATTACT GGTTCGTCGG GGACGAATGG GCCGATCAGG TTCCGGAATA A
|
Protein sequence | MTLSRRVFAA SLGVAAAMAA LPATAQSAAQ SLASESVLTT IQEEGVIRIG LSIFTPWSMR DVNGELIGFE LDVGRALAED MGVEVEFVPT AWDGIIPALL AGNFDVIISG MSITPQRNLT VNFTDPYAYS GMAILANTAM TEGMTMDDYN SPDITFAARR GATPATVIQN RFPEAELLLF DEDGASTQEV LNGNAHATMA SQPTPDREVR LNPETLSVPF DELIDPTGEA FAVRKGDPDA MNFFNNWIAA RTRSGWLEER HDYWFVGDEW ADQVPE
|
| |