Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5082 |
Symbol | |
ID | 8728847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 6213907 |
End bp | 6216285 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | Zn-dependent aminopeptidase |
Protein accession | YP_003389856 |
Protein GI | 284039926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.546478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAGA TAGTTTTGCT GTTGGCGAGT TTGGCCATCC CGTTCGGGCT GGTTCAGGCG CAAGCCCCAA CACAGCATGC TAACACGCGC TTTGAGCAGT TAGGGCCTTT GCTTCCTACG CCCAATACGT TCAGAACAGC CTCCGGTGCG CCGGGAAAAG ACTACTTCCA GAATCGAGCT GACTACGACA TTAAAGCAAC GCTCGATGAT ACCAAACAGC ATATAACCGG TTCGGAAACC ATCACTTACC ATAACAACTC GGGCGATGCG CTTCCGTATC TGTGGCTTCA ACTGGACCAG AACCTGTTCC GGCCCGATGC CAACGGTAAT ACCACCGAAA CCAACAGTAT CAACGCTCAG CGGGGTATGA CCGCAGAGCA GCTCGACCCC AGCTCAGTAT TGAAAGGCAA AGATTACGGG CACAAGATCA CGGCGGTGCG CGATCAGGCG GGAAAGGCCC TCAAGTACAC CATCAACCAG ACCATGATGC GCATCGACCT GCCCCAGCCT GTTGGGCCCG GCAAGTCGGT TGTATTTTCT ATTGACTGGA ACTTCAATAT TGTTGATGCG AAAGCTACCC ACGCCCGGTC AGGGTACGAG TATTTCCCAA AGGACGGTAA CTACGTCTAT GAAATTGCTC AATGGTTCCC CCGCCTGTGC GCTTACAGTG ATGTGACAGG CTGGCAGAAC AAACAGTTTC TGGGACAGGG GGAGTTCACC CTCATCTTTG GCAACTACAA GCTGGCCCTT ACCGTCCCCA ACGACCACAT CGTGGGCGCT ACGGGCGAGT TGCAGAACCC GGCCCAGGTG CTGACCCCTA TCCAGATCAA ACGCTGGAAC GAAGCCAAGG GCAAGGGTGA CAAGCCCGGC GAAAACCCCA CCGTCATCGT GACACAAGCC GAAGCTGAAG CGGCCGAAAA AGGTAAGCCG ACAGGCACCA AGACATGGGT ATTCAAGGCC GATAACGTCC GCGATTTCGC GTTTTCCAGC AGTCGTAAGT TCATCTGGGA TGCGCTGAAC CCCAACGTAG AAGGCAAGCG GGTATGGGCC ATGTCACTTT ACCCTAAAGA AGCGAATCCA CTCTGGGGTC AATACTCGAC CCGGCTGGTA GCGCACACAT TACGTTCGTA CTCTCGCCGG ACCATCGCTT ACCCCTATCC GGTGGCCTAC TCGGTTCACG GACCCGTGGG CGGCATGGAG TACCCCATGA TGTGCTTCAA CGGTGCCCGC CCCGAAGAAG ACGGAACGTA TTCGGAAGGC ACTAAAAACT TTTTGATTTT GGTCGTTATC CACGAAGTGG GTCACAACTT TTTTCCCATG ATCGTCAACT CCGACGAGCG GCAGTGGTCC TGGATGGACG AAGGACTCAA CAGCTTCCTT GAAGGGGTAA CCTGTTTAGA GTGGGACGCC AACTTCCCGG CGCGGGGCAT TCAACCCCAA TCGATCGTGT CTTACATGCG ACTTGATTCG ACCCAGCAGG TGCCCATCAT GAGCAGTTCG GATAATATTT TACCGAATAC CTTCGGGCCA AATGCTTATG ACAAACCCGC TACGGCGCTG AACATCCTGC GCGAGACGGT CATGGGCCGC AAGCTGTTCG ACTACGCCTT CAAAGAGTAC GCCCGTCGAT GGGCCTTCAA GTCCCCCGAA CCCGCCGACT TCTTCCGAAC CATGGAAGAT GCCTCCGGCG TTGACCTCGA CTGGTTCTGG AAAGGCTGGT TCTACGGCGT GCAGCCCGTC GACCAGGCAC TGGTCAAAGT CGACTGGTTC CAGGCCGGGT CACAAAACCC CGAGATCGCC AAGGCCGAAG CCCGGGCAGC CGCAGCCAAA CGTGCCAACA CCATCAGCAA GCAACGCGAT GCCGCCACCA AAGGCCAGAC CGTCGTCGCC CAGGACTCCA CCATGAAGGA CTTCTACAAC AGCTACGATC CCTACGCCGT TACCGAGGAA GACAAGAAGA AGTACCAGGA CTACCTGGCC ACACTCACCC CTGAGGAGCG CAAACTGGCC GAAGCAGGCA CCAACTTCTA CACCCTCTCG CTCAAGAACA AGGGCGGCAT CCCCATGCCG GTGATCGTGC GCATGGAGTT CGAGGATGGC ACCGACTCGG TGGCGCGCTT CCCGGCCGAG ATCTGGCGCT TCAACGACGT GTCGATCAAC AAGGTGATTG CCACCAGCAA GAAAGTGAAG CAGTGGACGC TGGACCCTTA CTACGAGATT GCCGACATCA ACACGGAGGA CAACAGCTTC CCGCCGGTGG CTCAGCCGAC GCGGTTCCAG TTGTTCAAAC AGCAGCAGCG GGGCGGTGGG GCCGCGCCTA ACCCCATGCA GCAGCAACGT CAGCAACAAC CCGCCAAACA AGGCACCGGT CGAAATTAA
|
Protein sequence | MQKIVLLLAS LAIPFGLVQA QAPTQHANTR FEQLGPLLPT PNTFRTASGA PGKDYFQNRA DYDIKATLDD TKQHITGSET ITYHNNSGDA LPYLWLQLDQ NLFRPDANGN TTETNSINAQ RGMTAEQLDP SSVLKGKDYG HKITAVRDQA GKALKYTINQ TMMRIDLPQP VGPGKSVVFS IDWNFNIVDA KATHARSGYE YFPKDGNYVY EIAQWFPRLC AYSDVTGWQN KQFLGQGEFT LIFGNYKLAL TVPNDHIVGA TGELQNPAQV LTPIQIKRWN EAKGKGDKPG ENPTVIVTQA EAEAAEKGKP TGTKTWVFKA DNVRDFAFSS SRKFIWDALN PNVEGKRVWA MSLYPKEANP LWGQYSTRLV AHTLRSYSRR TIAYPYPVAY SVHGPVGGME YPMMCFNGAR PEEDGTYSEG TKNFLILVVI HEVGHNFFPM IVNSDERQWS WMDEGLNSFL EGVTCLEWDA NFPARGIQPQ SIVSYMRLDS TQQVPIMSSS DNILPNTFGP NAYDKPATAL NILRETVMGR KLFDYAFKEY ARRWAFKSPE PADFFRTMED ASGVDLDWFW KGWFYGVQPV DQALVKVDWF QAGSQNPEIA KAEARAAAAK RANTISKQRD AATKGQTVVA QDSTMKDFYN SYDPYAVTEE DKKKYQDYLA TLTPEERKLA EAGTNFYTLS LKNKGGIPMP VIVRMEFEDG TDSVARFPAE IWRFNDVSIN KVIATSKKVK QWTLDPYYEI ADINTEDNSF PPVAQPTRFQ LFKQQQRGGG AAPNPMQQQR QQQPAKQGTG RN
|
| |