Gene Slin_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2104 
Symbol 
ID8725842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2540287 
End bp2542032 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content59% 
IMG OID 
ProductRagB/SusD domain protein 
Protein accessionYP_003386938 
Protein GI284037008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.475472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ACCTACCTGT CCTGATCGCC ACGAGCCTGT CCCTTACGGG CTGCGCTGAT 
CTGCTGAAAC CTGAGGTTGA AAATCTTCCG GGCAAGGAGG CCATGTATAA AGAGGTGACA
TTTGCCCAGG GGTTCCTGCT CAATGGGTAT ACCCGCCTGC CTGGCCTGTC ATTCAACGAT
GTGGCCACCG ACGATGCCGT CAGCAACGAT AAAAACAACG CTTTCCAGAA AGTCGCCACC
GGCCAGTGGA CGGCCAACTT TAACCCCGCC GACCAGTGGG TCAACAGTAT GGCGGCCATT
CAATACCTGA ATACCATGCT GGCGGAGGTC GATAAAGTAA CCTGGGCCAT CGACCCCAAA
GCCGCCCAGA TGTTCAGGGA CCGGATCAAG GGCGAAGCCT ACGGCCTGCG TGCGTTCTAC
ATGTTTCAAC TGCTGCGGGC CCACGGCGGC TGGTCGGGCG GGGGCGAGTT GCTGGGCGTT
CCGATCCTGA CCAAACCACA GGATACGAAC TCCGACTTCA ACCAGCCCCG CGCCACCTTC
GAAGCCTGTA TGCAGCAGTT CTACAGCGAC ATAAAGCAAG CCGAAACCCT ACTGCCACTC
GATTACGAAG ACGTATCAAC CGCTGGTCAG GTGCCATCTC AATATGCGGG TATCAGCCCG
GAAGAATACA CCCGGGTGTT CGGGCAGTAT TCCCGGCTCC GGCTGACGGC CCGCATTGCG
AAAGCCATCC GAGCCCAGGC AGCCCTTCTG GCGGCAAGTC CGGCTTACAA GACCGGCACG
ACCACCACAT GGGCCGATGC CGCCAATTAT GCCGGTGAAG TCCTGAAACT CATTGGCGGG
CCTGACCGAC TGGCGGCCAA TGGCGGAACC TGGTACGACA ACCGTACCGA GATCAACAGC
CTCGGCGCGG GTATCAACCC GCCGGAGATT CTGTGGCGGG GCGACGCGTC GAACAGCCGG
AATCTGGAAG AAGACAACTA CCCACCGTCG CTCTTCGGCC GGGGGCGGAT CAACCCCACG
CAGAACCTGG TCGATGCATT TCCAATGGCC AACGGATACC CCATTTCGGC ATCGGCGAGT
GAGTACAGAG CCCAGGCACC CTACACCAAC CGCGACCCTC GTCTGCAACG CTATGTAGTG
ACCAACGGCA GCACAATGGG GCCATCCAAC ACCGTTATCC GGACCGAAAC GGGCGCTGGT
AACGATGCCG TCAATCAGAT CGAAACATCG ACGCGTACGG GCTATTACCT GCGAAAACTG
CTCCGGGCTG ACGTAAACCT AAACCCCATT GGGGCCAATG AGCAGCGGCA TTACACGGCC
CGCATCCGGT ATACCGAGAT CTTCCTGATC TACGCCGAAG CGGCCAACGA GGCCTGGGGT
CCCGACGGAC AGGGGACGTT TGGTTTCTCG GCACGCGATG TGATCCGGGC GATCCGCAAA
CGGGCCGGCA TCGGTACAAC CAACAACGAC GCCTACCTGA CCTCGGTGCA GAGCCAGGCC
GACATGCGTC AGCTGATCCG GAATGAACGT CGGCTGGAGC TGTGCTTCGA AGGATTCCGG
TTCTGGGATC TGCGTCGCTG GAGCGAAAAA CTAACCGAAA CGGCCAACGG CATCCAGATC
ACGAACGGAA ACTATGCCGT TATCCCGGTC GAAAACAGAG CGTTTGCGGC CTACATGAAC
CACGGCCCGA TCCCATATGG CGAAATGCTC AAATGGAAGG CACTCGTTCA AAACAAAGGC
TGGTAA
 
Protein sequence
MKKYLPVLIA TSLSLTGCAD LLKPEVENLP GKEAMYKEVT FAQGFLLNGY TRLPGLSFND 
VATDDAVSND KNNAFQKVAT GQWTANFNPA DQWVNSMAAI QYLNTMLAEV DKVTWAIDPK
AAQMFRDRIK GEAYGLRAFY MFQLLRAHGG WSGGGELLGV PILTKPQDTN SDFNQPRATF
EACMQQFYSD IKQAETLLPL DYEDVSTAGQ VPSQYAGISP EEYTRVFGQY SRLRLTARIA
KAIRAQAALL AASPAYKTGT TTTWADAANY AGEVLKLIGG PDRLAANGGT WYDNRTEINS
LGAGINPPEI LWRGDASNSR NLEEDNYPPS LFGRGRINPT QNLVDAFPMA NGYPISASAS
EYRAQAPYTN RDPRLQRYVV TNGSTMGPSN TVIRTETGAG NDAVNQIETS TRTGYYLRKL
LRADVNLNPI GANEQRHYTA RIRYTEIFLI YAEAANEAWG PDGQGTFGFS ARDVIRAIRK
RAGIGTTNND AYLTSVQSQA DMRQLIRNER RLELCFEGFR FWDLRRWSEK LTETANGIQI
TNGNYAVIPV ENRAFAAYMN HGPIPYGEML KWKALVQNKG W