Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1996 |
Symbol | |
ID | 8725734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2409711 |
End bp | 2410997 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | FolC bifunctional protein |
Protein accession | YP_003386840 |
Protein GI | 284036910 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0734572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0222652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTACA CGGAAGCAAT CGACTATTTA TATAGTCGGC TCCCAGTTTT TCATCGAATT GGCCCCAAAG CCATAAAGCC GGGTTTAGGA AATACCTTAT TACTGTGCGA AGGGCTTGGA AACCCGCATC AGCAGTTTAC CAGCATTCAC GTTGCCGGAA CAAACGGGAA GGGGAGTACC TCCCATATGC TTGCAGCCAT TTACCAGTCG GCAGGGTACC GCGTTGGCCT ATATACGTCA CCCCATCTTA AATCGTTTAC AGAGCGGATA CGACTAAATG GACGGCCTAT CCCGGAAGAG GAAGTTGTTC GTTTTGTAGA GCAGCAGCAA CCGTTGATAG AATCGGTTGA GCCTTCTTTT TTCGAAGTAA CGGTTGCCAT GGCCTTCTAT TTCTTTGCTC GTCACGCCGT TGACATAGCC ATTATTGAAG TTGGCCTGGG AGGGCGTCTC GATTCTACCA ATGTAATCAC TCCTATTGCT TCGGTTATTA CCAATATAGG CTATGATCAT ACCGATATAC TGGGGGATAC GCTCCCGCTG ATAGCCGCCG AGAAAGCGGG TATTATTAAA CCAGGGGTTC CGGTTATTAT TGGTGAGTCA CATCCAGAAA CACAGGAGGT ATTTACATCC GTATCGGCAT CGCTTCAAGC CCCTATAACC TTTGCTGATC GACAGTATCT GGTAAATGAT TTAGGTTTGG TTGACGGAAT TCGGCAGGCC TCTATAAGCC GTGGTGATGG GTCTGGCTGG CTTGCTCAAC TCGACCTATT GGGAGCTTAC CAACTTAAGA ACCTCCCCGG TGTTTTTGCA ACTGTTGAAC AATTGCAACA GCAGTTCCCC GTTACAGCGG CTCAACAGCA GGAGGGGCTC GCTTCGGTAA GTTTATTGAC GGGATTAAAG GGCCGTTTTC AAACGCTGGG TTCACATCCC AGGGTTATTG CAGATACTGC CCATAATCAA CCTGGTTTGG AAGCCCTCTT CGATACGATA CGATCTATAC CTTACAAAAC GCTTCGTATT ATTATTGGCC TTGTGGCAGA TAAAGATCGT AGTAAGGTCC TATCTGTATT ACCCACAAAT GCCGTTTATT ATTTTTGTCA GGCGAATACT CCCCGCTCAT TATCGGCTCA GTTATTACAA CAGGAAGCGC GTGTTCTTGG CCGTATAGGG GATGTATTTA CTGATGTAAA TACTGCTTTA GCGGCAGCCC TAGAGCAGGC TGACCCTGAT GATTTACTGC TCATAACCGG CAGTAATTAT ACCATTGCTG AATTAACCAA TTTATAA
|
Protein sequence | MQYTEAIDYL YSRLPVFHRI GPKAIKPGLG NTLLLCEGLG NPHQQFTSIH VAGTNGKGST SHMLAAIYQS AGYRVGLYTS PHLKSFTERI RLNGRPIPEE EVVRFVEQQQ PLIESVEPSF FEVTVAMAFY FFARHAVDIA IIEVGLGGRL DSTNVITPIA SVITNIGYDH TDILGDTLPL IAAEKAGIIK PGVPVIIGES HPETQEVFTS VSASLQAPIT FADRQYLVND LGLVDGIRQA SISRGDGSGW LAQLDLLGAY QLKNLPGVFA TVEQLQQQFP VTAAQQQEGL ASVSLLTGLK GRFQTLGSHP RVIADTAHNQ PGLEALFDTI RSIPYKTLRI IIGLVADKDR SKVLSVLPTN AVYYFCQANT PRSLSAQLLQ QEARVLGRIG DVFTDVNTAL AAALEQADPD DLLLITGSNY TIAELTNL
|
| |