Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4413 |
Symbol | |
ID | 5060899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 4994870 |
End bp | 4996162 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640476676 |
Product | hypothetical protein |
Protein accession | YP_001161219 |
Protein GI | 145596922 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.788807 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACGA ACAGGAACCG GTGTAAGCAG CCGCGACGCC GCACGACCAC TGTGGTGACA GCGCTGCTCG GAGCTGCCCT GCTTGGATTT ACCGCCACCC CCATGCCGGT GTCGGCGAAC GAGGCAACCC TGCCCCTCGG CGACTCCGAC CTGGTGGAGA CTCGCAGCAG CGAGACGCTC GCCCAGGGCG TCACGCTGAC CCGGATCGTC CGCGGCACCG AGCCCGCACC GATCGATCAG ATCGGCGACA CTCCCCGTGG CCCTTGGGTG GTCAATGTCC TGACGATCGA CCCCACGCAG AGCAAGGGGC ACCTCGCCGC CACCTACGGA CCGGACCTGG CCGGAGTCGA AAAGACCACC GACCTCGTCC GCGAGGCTGA CGCGCTGGTC GGGGTCAACG CCTCCTTCTT CACCTTCACC GCCAGCGCCG AGTACCCCGG CGACCCTGTT GGCCTGGGGA TATACGGCGG GCGGCTCCTC AGCGAGCCCA CCGGTGACGC CGCGGAGGCA GACCTGGTCC TCGATGCCAA GAACCACAAG GTGCTCATGG GCCAGTTGCG GTGGACCGGT AGCGTCCGAA ACGCCCAGAC CAAGATCAGC CTTCCACTGG AGTACATCAA CCACCCGCCG GTTGTCCCGG ACCCCTGCAC TGATCTTCCA GACCCGACCC AGTGCGCCGA CTCCGGCGAC ACGGTGCGAT TCACGCCGGA GTTCGCCGCC AGCACGCCGT CCGGGCCGGG CGTCGAGGTC GTCCTCGACA GCAAGGGCTG TGTGGTCCGG ACCACGACCA CCCGCGGTAC GACTCTCGCC CAGGACCAGA CCTCCCTACA GGCGACCGGG CGGGAAGCGG CGGATCTGCT CGCCGTGGCG CAGGGCTGCG TGAAGCACTC CAGCTCCCTG CAGGACGAGG CGGGTAAGAA GATTCCGCTG CGCCCCGGGA CGTTCGCCGT GAACGGCCGC TACCGGCTGG TGAAGGACGG GCAGATCGTC GCGCCATCCG GCTCGGACAG CTTCTTCGAT CGCCACCCGC GCACCATCGC CGGAACCACC CTCGACGGCA AGATTGTGCT CGTGACCATC GATGGCCGGC AGACCACCAG CGTTGGCACC ACCATGACCG AGACCGCCTC GGTCGCCGCC GCACTCGGCA TGCACGACGC GGTCAATCTG GATGGCGGTG GATCAACCAC AATGTCGGTC GAAGGCTCCC TGGTCAACCA GCCGAGCGGC AACGAGGAGC GGCCCGTCGG CGACGCGCTC GTCTACATCG ATCGCACGTT CGACGATCGC TGA
|
Protein sequence | MTTNRNRCKQ PRRRTTTVVT ALLGAALLGF TATPMPVSAN EATLPLGDSD LVETRSSETL AQGVTLTRIV RGTEPAPIDQ IGDTPRGPWV VNVLTIDPTQ SKGHLAATYG PDLAGVEKTT DLVREADALV GVNASFFTFT ASAEYPGDPV GLGIYGGRLL SEPTGDAAEA DLVLDAKNHK VLMGQLRWTG SVRNAQTKIS LPLEYINHPP VVPDPCTDLP DPTQCADSGD TVRFTPEFAA STPSGPGVEV VLDSKGCVVR TTTTRGTTLA QDQTSLQATG REAADLLAVA QGCVKHSSSL QDEAGKKIPL RPGTFAVNGR YRLVKDGQIV APSGSDSFFD RHPRTIAGTT LDGKIVLVTI DGRQTTSVGT TMTETASVAA ALGMHDAVNL DGGGSTTMSV EGSLVNQPSG NEERPVGDAL VYIDRTFDDR
|
| |