Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4486 |
Symbol | |
ID | 8728250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5435727 |
End bp | 5437748 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | Endothelin-converting enzyme 1 |
Protein accession | YP_003389265 |
Protein GI | 284039335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000750073 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.251241 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAC GAGTAAGTTT TATAGTAACG GGTGCTCTGG CTATTGCATT TATAGCCGCC GGTCCCCGAA AGTTTATTGA CCCGCAAAAC ATGGATTTAT CCGTGAAGCC CGGCGATAAC TTCTATCAGT ATGCTAACGG GAACTGGCTT CGTCAAAATG CTATTCCAGC TTCCAAAACT TCATGGGGGA GTTTCAATGA GCTTCGCGAC AAGAGTCTGG ATGCGATGAA GACCCTGCTT GAGGATGCGT CCAAAACTAC GACTAAAGGT CGGCTGTATC AGATGGTGGG CGACTATTAC ATGAGTGGTA TGGACAGCCT GACGATTGAA AAGCTTGGTT TTGATCCAAT CAAACCTGAG CTGGCCCGTA TCGAAAAAGT TAACAACAAG GCCACTTTTT TGGACGAATT GGCCTACCAG CGGACACAGA GCAACGGCAT GTTGTTCGGT TTTTTTGTCA GCCAGGACCG TAAGAATGTT TCCAAATATC TACCCCAGTT CAGCCAGGGG GGAACCACGT TGCCCGACCG TGATTATTAC CTGAAAAATG ATACCCGTAG TGTAAAAATC AGGGATGCAT ACCGGGATAA TTTGACCAAA ATGTTCGGCC TGATTGGTGA GGAGCCAACC CAGGCTTCGC AGGATGCCGA TGTGATTATG CGGGTAGAAA CGGCGCTGGC TAAAGCCCAG ATGCCCCGTG TGGAAATGCG CGATCCGTAC AAGACGTATA ACAAACTGAC CATCGCCGAC TTCAACAAGC TAACGCCAGG TATCAACTGG ACTGATCAGC TGACTAAGTT CGGCGCGAAA GGGCAGGATA CGGTACTGGT ACAAAGCCCG GCGTTTTTCC GCTCGCTGGA TAGTCTGGTA GCTGCTACGC CTATCGAAGA TCTGCGGACA TACATGCGCT GGAACATTCT GAAAGGGGCC GCTCCGTTCC TGAGTGATGC GTTTGTAAAA CAAAACTTTG CCTTTTCGAA AGTGCTGACC GGTCAGAAAG AGCAGACGCC CCGCTGGCAG CGTGTGAGTG GACTCATTGA CAACTCACTC GGCGATTTAC TGGGTCAACT GTACGTACAG CGGTACTTCA AACCGGAAGC TAAACAGCGC ATGCTCACAT TGGTCGGTAA CCTCGAAGAT TCATACAAAG AGCACATCAA AAACCTTGAC TGGATGAGCG AGGATACTAA GAAGAAAGCG CTCACCAAAT TGCTGTCCTT CAAACGGAAA ATAGGTTATC CTGACAAGTG GAAGAATTAT GATGGTGTTA CCATTGCCCG CAATGATTAC TACGGCAATG TAAAATCGGC CAGCAAATGG TCGTACAACT ACATGATTAA CCGCCTGGGT AAGCCTGTCG ACAAAACGGA GTGGGGGATG ACACCGCCAA CGGTAAACGC CTACTACAAT CCAGTAAACA ACGAGATTGC GTTCCCGGCG GCTATCCTGC AGTTTCCCTT CTTCGATTTC GACGCGGACG ATGCTATCAA CTACGGTGGT ATCGGGGCTG TAATCGGTCA TGAAATGACG CATGGGTTCG ACGACTCAGG CCGTCAGTAT GATGCCGACG GAACCCTGCG CGACTGGTGG ACCAAAACCG ATGCTGATAA CTTTAAGAAA CGGGCCGATC AGGTAAAAGA GCAGTTCTTC GGTTTTAAGG TATTGGATTC CATCAAAGTG AACGGTCAGC TGACCCTTGG CGAAAACCTC GCCGATCTGG GCGGACTGGC TATTGCCTAT GATGCTTTCA AGAAGACGGC ACAGGGAAAG TCGAGCGGTA AAAAAAGTAT GATTGATGGC TTCACACCCG ATCAGCGTTT TTTCCTGTCG TGGGCACAGG TGTGGCGGAT CAATGTTTTA CCCGAAACGC AGGCACAGTT GATCATGACT GATCCCCATG CGCCGGGGAT CTACCGTTGC AACGGACCTC TGGCAAACAT TAATGCATGG TACGAAGCGT TCAATGTGAA GCCGGGCGAC AAAATGTATA AAAAGCCGGA AGACCGAATT AAGGTCTGGT AA
|
Protein sequence | MTKRVSFIVT GALAIAFIAA GPRKFIDPQN MDLSVKPGDN FYQYANGNWL RQNAIPASKT SWGSFNELRD KSLDAMKTLL EDASKTTTKG RLYQMVGDYY MSGMDSLTIE KLGFDPIKPE LARIEKVNNK ATFLDELAYQ RTQSNGMLFG FFVSQDRKNV SKYLPQFSQG GTTLPDRDYY LKNDTRSVKI RDAYRDNLTK MFGLIGEEPT QASQDADVIM RVETALAKAQ MPRVEMRDPY KTYNKLTIAD FNKLTPGINW TDQLTKFGAK GQDTVLVQSP AFFRSLDSLV AATPIEDLRT YMRWNILKGA APFLSDAFVK QNFAFSKVLT GQKEQTPRWQ RVSGLIDNSL GDLLGQLYVQ RYFKPEAKQR MLTLVGNLED SYKEHIKNLD WMSEDTKKKA LTKLLSFKRK IGYPDKWKNY DGVTIARNDY YGNVKSASKW SYNYMINRLG KPVDKTEWGM TPPTVNAYYN PVNNEIAFPA AILQFPFFDF DADDAINYGG IGAVIGHEMT HGFDDSGRQY DADGTLRDWW TKTDADNFKK RADQVKEQFF GFKVLDSIKV NGQLTLGENL ADLGGLAIAY DAFKKTAQGK SSGKKSMIDG FTPDQRFFLS WAQVWRINVL PETQAQLIMT DPHAPGIYRC NGPLANINAW YEAFNVKPGD KMYKKPEDRI KVW
|
| |