Gene Slin_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3041 
Symbol 
ID8726793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3694337 
End bp3695569 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content47% 
IMG OID 
ProductROK family protein 
Protein accessionYP_003387851 
Protein GI284037921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0418823 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAG CTTTTACCAT TGATGAAGAA GCACCGATCT CTCAGTCGGT TGTTGATTAC 
AAAAAAAACC AGAAACAGCG GAAAGTACTT GCACATCTGT ATTCAGAAGG CACCTGTACG
CTGGCACATT TGACTAGGAT GCTGCACAGC AGCGTACCAT CTGTAACCTC ACTTGTTGAA
GAACTTATTG ATAACCAATG GGTAACCCCC ATCGGCACCG CTACTGGAAG TAATGGGCGT
CGCCCGGTTT TGTTTGGGTT AAACACAAAG AGCCATTACG TAGCCGTGCT GGATGTTAGT
ACTCACGATA CCAAAATCCT TTTCATGAAC GTTCAGCGAA AGGTGATCTT TCGGGGAGAC
TATGATCTGC GATTAAACGA CAATCCAAGC TTTCTCACTT CGCTGGTTCA CTACTTTCAC
AATGCCCTGG CCGATTCCGA GATTAGTATC GACGACCTCA TTGCTGTGGG TATTTCTATG
CCAGGTCTGG TAGACGCCCG CCGGGGACTG AATCTGACTT ACAAGAATTT ACACCAAACA
GGTGAATCAC TTCCTCAATG GCTTTCGGCC GAGCTTAATA AGCCCGTTTA CCTAATTAAC
GACACAAAGG CGACTGTACT GGGCGAGAGC CGGTTTGGTG GAGCACAGGG AAAAAAGCAG
GTACTGGCTA TCAATATCGA TTGGGGCGTT GGGTTAGGTA TCATTGTTAA TGGGGAAGTA
TTCCAGGGAG CCAGCGGATT TGCGGGCGAA CTGGGTCACA TTCAGGTCGA CCCGGATGGC
GAGTTGTGCT TTTGTGGCAA AATAGGCTGT CTGAGCACCA TAACGTCAGC CTCTGCACTG
GTAAAACGAG CGCAGACAGA TATTCTGGCT GGACAAGTTT CCAAACTGGC CACCTTTCGC
GATCATGTCG ATCAAATCGA TATTGACGAA GTAATTAATG CGGCTAACTC CGGCGACTCT
TACGCCATCG ACATTCTGCA CGAAACAGGC TATCAACTGG GTAAAGGACT CGCAATAGCC
ATCAGCCTGT TCAACCCGGA AATAATTGTT GTTGATGGTG TTCTTTTCAA AGCAGCCGCT
TTTATTCTGA ACACGATTGA GCAGGCAATC AGCAAATATT GCCTGAGTGA CTTTCGAAAC
GACATGACCA TCGAAGTGAC ACAGCTAAAC GGTACGGCCA AATGGTTGGG TACCCATGCT
TACATGATGG AGGATATTTT CGCCAATTAT TAA
 
Protein sequence
MNTAFTIDEE APISQSVVDY KKNQKQRKVL AHLYSEGTCT LAHLTRMLHS SVPSVTSLVE 
ELIDNQWVTP IGTATGSNGR RPVLFGLNTK SHYVAVLDVS THDTKILFMN VQRKVIFRGD
YDLRLNDNPS FLTSLVHYFH NALADSEISI DDLIAVGISM PGLVDARRGL NLTYKNLHQT
GESLPQWLSA ELNKPVYLIN DTKATVLGES RFGGAQGKKQ VLAINIDWGV GLGIIVNGEV
FQGASGFAGE LGHIQVDPDG ELCFCGKIGC LSTITSASAL VKRAQTDILA GQVSKLATFR
DHVDQIDIDE VINAANSGDS YAIDILHETG YQLGKGLAIA ISLFNPEIIV VDGVLFKAAA
FILNTIEQAI SKYCLSDFRN DMTIEVTQLN GTAKWLGTHA YMMEDIFANY