Gene Slin_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3931 
Symbol 
ID8727689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4713922 
End bp4715301 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content54% 
IMG OID 
Productprotein of unknown function DUF162 
Protein accessionYP_003388720 
Protein GI284038790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.463483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC AGCTTATTTT AGATCACGCC GATGCCGCCG TTGAGTTCAA TAAGAACGAA 
GCGAAGGTCG ACTGGCACGA CGAAACGCTC TGGTTTGTAC GTACCAAACG CGACCGGGCC
GTTGCCCAGA TACCCGAATG GGAACAACTG CGGGAAGCAG CCTCGCAGAT AAAAAACCAT
GTGCTGTCCA ATATGCACGA TCTGCTGATC CAGTTTGAAG AAAACGCGCT TCGAAATGGC
ATCAAGATTC ACTGGGCTGC CAATGGCGAC GAGCACAACG CCATTATTCT TGACATTATA
AAACAGGCGG GGGCGAATCG GATGGTCAAA TCAAAGTCGA TGCTGACCGA AGAATGCCAC
CTGAACAAAT ACCTGATCGA TAACGGGATT GAGGTGATCG ACAGCGATTT GGGCGAACGG
ATTGTGCAGA TGCGCAACGA GCCGCCATCC CACATTGTGC TGCCCGCCAT TCACCTGACA
AAAGCCGAAG TTGGCGAAAC CTTCCACGAA CACCTGGGTA CCGAAAAAGG CGCTACCGAC
CCGCAATACC TGACCGAAGC CGCCCGGCAG CACCTGCGCG AAACGTTTCT GACGCGTAAA
GTGGCCCTTA CGGGTGTTAA CTTCGCCATT GCCGAAACAG GTGGTTTTGT GGTGTGTACT
AACGAAGGCA ATGCCGATAT GGGCGCGCAC CTGGCCGATG TACACATTGC GGCCATGGGT
TTCGAAAAGA TCATTCCCCG CGCCGAACAT TTAGGTGTTT TCCTGCGGCT GCTGGCTCGT
TCGGCCACCG GACAACCCAT TACGACGTTC TCCAGCCATT TTCACCGGCC ACGTCCGGGG
CAGGAGATGC ACATCGTTAT TGTTGATAAC GGACGCAGCC GTCAGCTAGG CCGACCCGAT
TTCCGAAATT CGCTCAAGTG CATCCGCTGC GCTGCCTGTT TAAATACCTG TCCCGTTTAC
CGGCGGTCGG GTGGGTTCAG CTACCACAGC GCCGTTGCCG GGCCAATTGG CTCTATTCTG
GCACCGAATC TGGACATGAA AAAGAACGCC GATTTGCCCT TTGCCTCAAC GCTTTGCGGG
TCGTGTTCCA ACGTGTGTCC GGTTAAGATC GATATTCACG ACCAGCTCTA CAAATGGCGT
CAGGTGCTGA TGCAGGAAGG ATATGGCCCC GGTAGTAAGA CCGTTGCCAT GAAGGCAATG
GCAACCGTAC TGGAGTCGCC CCGGTTATAC CGGATGGCGG GTAAAGTAGG GCGGGGGGTG
CTGCGATTTG CCCCCATTAC CGTCGAAAAC GGACTCAATC CCTGGTATAA CCAGCGCGAA
ATGCCCGAAC CACCTTCCGA AAGTTTCCAC GACTGGTACG TTAATCAGAA AAAATCATGA
 
Protein sequence
MNKQLILDHA DAAVEFNKNE AKVDWHDETL WFVRTKRDRA VAQIPEWEQL REAASQIKNH 
VLSNMHDLLI QFEENALRNG IKIHWAANGD EHNAIILDII KQAGANRMVK SKSMLTEECH
LNKYLIDNGI EVIDSDLGER IVQMRNEPPS HIVLPAIHLT KAEVGETFHE HLGTEKGATD
PQYLTEAARQ HLRETFLTRK VALTGVNFAI AETGGFVVCT NEGNADMGAH LADVHIAAMG
FEKIIPRAEH LGVFLRLLAR SATGQPITTF SSHFHRPRPG QEMHIVIVDN GRSRQLGRPD
FRNSLKCIRC AACLNTCPVY RRSGGFSYHS AVAGPIGSIL APNLDMKKNA DLPFASTLCG
SCSNVCPVKI DIHDQLYKWR QVLMQEGYGP GSKTVAMKAM ATVLESPRLY RMAGKVGRGV
LRFAPITVEN GLNPWYNQRE MPEPPSESFH DWYVNQKKS