Gene Slin_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1756 
Symbol 
ID8725493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2113521 
End bp2114837 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content53% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003386600 
Protein GI284036670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.60376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.594036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCC GCAGGACTTT CTTCCGCCAA ACCGCAGCCG CAGCGACCGG TGCACTGACG 
CTTCCACAGT TTTTACCCGA AACGTTCGCC AACCCAATTG CCACGGCCGA TGTCGGGCTA
AAGTCGGCCG CTCAGTGGGC GCAGGATGAA GATTTCTGGT CCTGGATAAA GTCAGAGTAC
ACCGTATCGC CTAATCTGCT TAACCTCAAC AACGGGGGCG TTTGTCCGCA GCCGAAAGTA
GTACAGGACG CGCACATCCG GTTTTACCAG TACTGCAACG AAGCGCCGTC GTACTACATG
TGGCGAATTC TGGATCAGGG CCGGGAAGCC CTGCGGAGCA AGCTGGCCGA TTTGGGCGGC
TGTTCGGCGG AGGAAATCGC CATCAACCGG AACGCTACCG AAGGACTAAA TACGGTCATC
TTCGGCCTGA ATCTGAAAGC GGGGGATGAA GTCGTACTCA CGAAGCAGGA TTACCCGAAT
ATGCTCAATG CCTGGAAACA GCGCGAGAAA CGGGACGGCA TTAAGCTGGT TTACCTGGAC
CTGAACCTGC CCAGTGAGGA CGACGATGCC CTGGCTGAGC AATACATCCG GGCGTTTACA
CCCCGTACTA AAGTGGTGCA TGTTACGCAC ATGATTAACT GGATTGGGCA GGTGATGCCC
GTTCGAAAGA TTGCCGATGC CGCCCACAAA CGCGGTATCG AAGTGATTGC CGATGGCGCT
CACTCCTTCG CCTTGTTCGA TTTCAAGATT CCCGACTTGG GGTGCGACTA TTTTGCGAGT
AGCCTGCATA AGTGGCTATC TGCCCCTTTC GGCAGCGGGA TGCTGTATAT CAGGCAGAAT
AAGATTAAAA ACGTTTGGGC TTTATTGTCG AACAATGAGC CCGATGGCCC TGATATTCGG
AAATTCGAAA GCCTGGGAAC GCGTTCTTTT GCCTCCGAAA TGGCCATCGG AACGGCGGTT
GATTTTCATA ATAGTATCGG CTCAGCGCGC AAATTTGCCC GTGCGCACTA CCTCAAAAAC
TACTGGATGG AGCGGGTGAA AGATATTCCC GGTGTTAAGG TTCATACGTC TTTCAAGCCT
GAATTTGCGG GAGCTGTAGC GCTGTTTTCC ATCGATGGCA TGAAACCCTC CGAAATAGAC
GGCCAATTGC TCAATCAGTA CAAAATACAT ACGGTGGGGA TTGAGTGGGA AAATATTAAA
GGTGTACGGG TAACCCCCCA CGTCTACCAT TCGCCCAAAG ATCTGGATCG GCTGGTGGCT
GCGATCACAG CCATAGCTGA CAAACAGATT GCCATTGGTA AGCAAAAGAA GAGCTAG
 
Protein sequence
MATRRTFFRQ TAAAATGALT LPQFLPETFA NPIATADVGL KSAAQWAQDE DFWSWIKSEY 
TVSPNLLNLN NGGVCPQPKV VQDAHIRFYQ YCNEAPSYYM WRILDQGREA LRSKLADLGG
CSAEEIAINR NATEGLNTVI FGLNLKAGDE VVLTKQDYPN MLNAWKQREK RDGIKLVYLD
LNLPSEDDDA LAEQYIRAFT PRTKVVHVTH MINWIGQVMP VRKIADAAHK RGIEVIADGA
HSFALFDFKI PDLGCDYFAS SLHKWLSAPF GSGMLYIRQN KIKNVWALLS NNEPDGPDIR
KFESLGTRSF ASEMAIGTAV DFHNSIGSAR KFARAHYLKN YWMERVKDIP GVKVHTSFKP
EFAGAVALFS IDGMKPSEID GQLLNQYKIH TVGIEWENIK GVRVTPHVYH SPKDLDRLVA
AITAIADKQI AIGKQKKS