Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4568 |
Symbol | |
ID | 8728332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5540057 |
End bp | 5541397 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | PARP catalytic domain protein |
Protein accession | YP_003389346 |
Protein GI | 284039416 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0566716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGC TAAATCAGTT GTTACAACGG GTGTGGGGGC AGTCTGCACC AGCCACCCTC CTTCAAACCT CCCGCCCTGT ATCGTCTACA GCCGAACCCG TTTTGGCTTC GCAGCCAATC CGGCACCTGC GTACCATGAA GCTCATGATG GTAACGGCCG AGAACAACAA CAAATACTAT GAAATGCGGG AGAAGGAAAA TGGCACCTTC GAAGCACACT ATGGCCGCGT AGGTGGTTTC CGAAGCAAAT CGGTTCACCC AATGGCGCAG TGGGAACGGA AAGTCCGCGA GAAAAAATCG AAGGGCTACA CCGATCAGAC CCACCTCTTT GCCGAAAGCC GACCTGAAAC AGATGCCAGT ACCATTGCCA ATCCAGAGGT TCGAAACCTG ATGGCACGAC TGCTGGAATT AGCCAGACAG TCCATCTTTC AGAACTATGT CGTTACGGCT CAGGAGGTAA CGCGTAAACA GGTCGAACAC GCTCAGCAAC TGCTCGACGA GCTGGCCAAC CTTCTGGCGG TCAACATGGA TACAGCGGCC TACAATGCCA AATTGCTGGA GCTGTTCAAG GTGATACCCC GCCGAATGAG CAAGGTAGGG GAGCACTTGG CAGGCGCATC GCCCCAGTCG GCCGAGGAAC TACAACCCCT TCGCGACCAC CTCGCCGAAG AGCAGTCTAC CCTCGATGTG ATGCGCGGTC AGGTTGAACT GGCCCCCGAA TCAACCCCCG ATGACCAGCC CCAGCCAACG CTGCTGGACA GCCTGAACCT GGCTATCGAG CCGGTGACCG ATGCGCATAT CATTACCCTG ATCAAGCGCA TGATGGGTAC CGATGCCGTC AAATTCGACG CAGCGTTCAG TGTCCGCCAT ACGGCTACCG ATGCCGCTTT CGATGCCTAT GTGCGTCAAC AGAAGAATCG AAAAACCATG GCGCTCTGGC ACGGAAGCCG CAGCGAGAAC TGGCTGTCTA TTTTGAAAAC GGGGCTGTTG CTGCGTCCGG CCAATGCCGT TATTACGGGC AAGATGTTTG GCTACGGCAT CTACTTCGCC GACCAGTTCA GCAAATCGCT CAACTACACC TCACTGAACG GCTCGGTATG GGCCAACGGT CGGCAGTCGG AAGGGTATCT GGCTATTTAT GAAGTACATG TCGGCGAACA ACTGGAGCTG ACAAAACACG AACCCAGCCA CATGCAACTG GACATTAACG CCCTGAAGCA ACTTGACCCA CAGTATGATT CCGTATTTGC CCGGCAGGGG GTCAGCCTGC AGAAGAACGA ATTCATTGTA TACAACCCGG CGCAGTGTAC GGTTCGGTAC ATTGTCAAAA TCAAGGCTTA A
|
Protein sequence | MTMLNQLLQR VWGQSAPATL LQTSRPVSST AEPVLASQPI RHLRTMKLMM VTAENNNKYY EMREKENGTF EAHYGRVGGF RSKSVHPMAQ WERKVREKKS KGYTDQTHLF AESRPETDAS TIANPEVRNL MARLLELARQ SIFQNYVVTA QEVTRKQVEH AQQLLDELAN LLAVNMDTAA YNAKLLELFK VIPRRMSKVG EHLAGASPQS AEELQPLRDH LAEEQSTLDV MRGQVELAPE STPDDQPQPT LLDSLNLAIE PVTDAHIITL IKRMMGTDAV KFDAAFSVRH TATDAAFDAY VRQQKNRKTM ALWHGSRSEN WLSILKTGLL LRPANAVITG KMFGYGIYFA DQFSKSLNYT SLNGSVWANG RQSEGYLAIY EVHVGEQLEL TKHEPSHMQL DINALKQLDP QYDSVFARQG VSLQKNEFIV YNPAQCTVRY IVKIKA
|
| |