Gene Slin_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4174 
Symbol 
ID8727933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5024584 
End bp5026020 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content54% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionYP_003388959 
Protein GI284039029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.427253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.28467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATAA AATCCCTTGA CCTTTTACGG CAGGAAGCCG CTCAAAGCGG AGAATCAACT 
CTCAAACGTT CGTTAGGTGC GTTCAATCTG GTTGCTATTG GCATTGGTGT TATTATTGGA
GCCGGGTTGT TTTCGCTGAC GGGGCTGGCC GCTGCCAACC ATGCCGGACC TGCGGTAACG
CTTTCATTCG TTGTAGCCGC CGTAGGCTGC GCGTTCAGTG CGCTGTGTTA TGCCGAGTTT
GCGGCTATGG TGCCGGTTGC AGGCAGTGCT TATACGTACG CCTATGCTAC CCTGGGTGAG
TTGTTCGCCT GGATTATTGG CTGGGACCTG GTACTGGAGT ATTCAGTTGG GGCCGCTACG
GTAGCCATTA GCTGGTCGCA GTACCTGACC CGGTTTCTGA GTTATTTTGG GCTGCACCTG
CCGCCCCGGT TCATGGCATC GCCGTTCGAA ACGGTTATGG GAGCCGATGG AACGGCCGTT
TCGGGTTTCG TCAACCTGCC CGCCGTTCTG ATAGTGGTGC TTATTACCGG CATTATTATT
CGGGGTACAT CGGGTTCTAC CTTGTTCAAC TCCATTGTGG TAACGCTGAA GGTACTGGTT
GTGCTGGTGT TCATTGCGCT CGGCTGGCAG TACATAGACC CCGTTAACTA TCAGCCTTAC
ATTCCGGAGA ATACCGGTAC GTTTGGCGAA TTTGGCTGGA GTGGCGTATT ACGTGGCGCG
GGGGTTGTCT TTTTTGTCTT TATCGGCTTC GATATTGTGG CGACTATGGC GCAGGAGGCC
CGCAACCCCC AGCGCGACAT GCCCATTGGC ATCCTTGGAT CGCTGGTGGT TTGTACGGTG
TTATTTGTGC TTTTCGGGTA CGTCATGACA GGTATGGCCA ACTACACGGA GTTTAAAAAC
AGTGCCGCTC CGGTAGCCAT CGCCATTGCC AAAACGCCCT ATCGCTGGCT CGGCCAGTTG
ATTATCGGGG CTATTCTGAT TGGGTATACC TCGGTCATTC TGGTCGATTT ACTGGGTCAG
TCGCGGGTGT TTTTCTCGAT GAGCCGCGAC GGTCTGTTGC CCAAAGCGTT CTCCGAAATT
CATCCCCGCT TCCGAACCCC CTGGAAGGCA AACCTGCTGT TATGTGCGTT TATCGGCCTG
TTTGCGGGCT TTGTACCCAT CCGGGTGGTT GGCGAAATGA CGAGTATCGG AACGTTGCTG
GCTTTTGTGA TGGTTTGTGT AGGTATACTG ATTCTCCGCA AAAAACAACC CGACGCACCC
CGGTCATTCC GGACGCCCTG GGTACCGGTG GTGCCTGTTC TGGGCATTCT TACCTGTATG
GTCATGATGG TCTCACTGCC GGGCGACACC TGGCTGAGGC TGGTGGTGTG GCTGGCTATC
GGGCTGGTGA TTTACTTTTT GTATGGCCGA AAGCGGAGTA AATTGCGGGA AAGCTGA
 
Protein sequence
MRIKSLDLLR QEAAQSGEST LKRSLGAFNL VAIGIGVIIG AGLFSLTGLA AANHAGPAVT 
LSFVVAAVGC AFSALCYAEF AAMVPVAGSA YTYAYATLGE LFAWIIGWDL VLEYSVGAAT
VAISWSQYLT RFLSYFGLHL PPRFMASPFE TVMGADGTAV SGFVNLPAVL IVVLITGIII
RGTSGSTLFN SIVVTLKVLV VLVFIALGWQ YIDPVNYQPY IPENTGTFGE FGWSGVLRGA
GVVFFVFIGF DIVATMAQEA RNPQRDMPIG ILGSLVVCTV LFVLFGYVMT GMANYTEFKN
SAAPVAIAIA KTPYRWLGQL IIGAILIGYT SVILVDLLGQ SRVFFSMSRD GLLPKAFSEI
HPRFRTPWKA NLLLCAFIGL FAGFVPIRVV GEMTSIGTLL AFVMVCVGIL ILRKKQPDAP
RSFRTPWVPV VPVLGILTCM VMMVSLPGDT WLRLVVWLAI GLVIYFLYGR KRSKLRES