Gene Slin_4720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4720 
Symbol 
ID8728484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5745606 
End bp5747573 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1680 
Protein accessionYP_003389497 
Protein GI284039567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.827878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA GAATAGCTTT CGGCTTGTTA ATTAGTCTGT TAGCGACCAT CTCCTACGCG 
CAGGATTACC CCATTGTTCC GGTTCCGTTC ACGAGTGTTA CCATCAACGG CGGTTTCTGG
CAACCCCGAC TCGTCACGAA TCGCACCGTC ACGCTTCCGT TCGATTTTAA GAAATGCGAA
GAGACCGGCC GCATCGCTAA TTTCGCCATA GCGGGTGGGC TGGCGAAAGG AACATTCAAA
GGAAAACGGT ACGACGATTC CGATGTGTTC AAGGTCGTTG AAGGCGCTTC GTATTCCCTG
CAAACGCACT ATGACGCCAA ACTGGATCGC TATCTGGATA GCCTCATTAC CCTTTTCGCT
GCGGCCCAGG AGCCGGACGG CTATCTCTAC ACCATCCGGA CTATCCTGAA AGACAGCGCC
AACATCAAAG ACGATCAGGC CGGGCCAACG CGCTTTTCGT ACGTGGCGGG TAGTCATGAA
CTTTACAACG TCGGGCATAT GTACGAAGCC GCCGTGGCTC ATTTTCAGGC TACCGGCAAA
CGCTCGTTCC TGAACATAGC CCTCAAAAAC GCCGATTACC TGATGCGGAC CATCGGTCCC
GATAAGCTGA TTGTGGTACC CGGGCATCAG GAAATAGAAA TCGGTCTGGT CAAACTCTAC
CGTGTTACTA ACGACAAACG ATACCTCGAT TTCGCCCGTT TTCTGCTCGA TATGCGCGGC
CGTGCCGATA AACGTCCGTT GTTCCCAGAT CCCGCCAAAA CCGGTCAGGG GGCCAGTTAT
TTGCAGGACC ACCTACCTGT AACGCAGCAG AAAACCGCCG TTGGTCACTC GGTCCGGGCG
GGCTACATGT ATGCGGCTAT GAGCGACATT GCCGCTATCC AGAAAGACAA AGCTTACATG
GATGCGCTGC TGGCGATCTG GAACGATGTG GTTGAACGGA AACAGTACCT TACCGGGGGG
CTGGGTGCCC GTGGACATGG CGAAGCCTTC GGGGAGGCCT ATGAACTGCC CAACGATGTC
GCTTATGCCG AAACCTGTGC GGCCGTGGCC AATATGCTCT GGAACCACCG GATGTTTCTG
CTCACCGGCG AATCGAAATA CATGGATGTG TTTGAGCGGG TGCTGTACAA CGGTTTTCTG
GCCGGTGTTT CGCTCGAAGG CGACTCGTTC TTCTACGTCA ATCCGCTGGC TTCGGACGGC
AAGCGGAAGT TCAACGTAGG ACAGGCGGCT ACGCGGGCAC CCTGGTTCGG AACGTCCTGC
TGCCCTACCA ATGTGGTTCG GTTTCTGCCT TCGCTGCCCG GTTATGTGTA CGCCACCAAG
GGCGATAACC TGTTCATCAA CCTGTTCCTG ACTAACCAGT CGAAACTGTC GGTCAATGGG
AAATCAGTGC AGATCAGGCA GGAAACAAAT TATCCATGGG ATGGAAATGT GGCGATAACG
GTGCAGCCCA AACTGGCGCA GACCTTTACC ATTCAGTTGC GCCTGCCCGG TTGGGCGTCG
GGTACACCCA TGCCGGGATA CTTGTACGAG TACGTAAATA CGACGGCTAA AACACCTGTT
TTGCTGGTGA ACGGAAAACC GGTGCCGTAT AAAATAGAGA ATGGCTACGC CCGCATTAGC
CGAACCTGGA AACCCGGCGA CCGGCTGGAG TGGACGCTCG ATATGCCCGT TCGGGAAGTG
AAAGCCAACG AACAGGTGAC CGACGACCGC AAGAAGGTGG CTATTGAGCG AGGGCCGTTG
GTATACTGCG CTGAAGGTGT CGACAATGGT GGACAGGCGC TGTCCCTGGC GGTGCCTGCC
GGAACGACGT TCCGGCCACT GATGCAGCCC GACAAATTAG GCGGTATTCT GTCGCTGTCG
GGGCAGGAAG CCGGTAAAAG CGTGACGCTG ATTCCGTACT ACGCCTGGTC GCACCGGGGG
CCAAACGAAA TGGCGGTCTG GTTTGGGCAA ATGAAACCGG ACAGGTAA
 
Protein sequence
MTKRIAFGLL ISLLATISYA QDYPIVPVPF TSVTINGGFW QPRLVTNRTV TLPFDFKKCE 
ETGRIANFAI AGGLAKGTFK GKRYDDSDVF KVVEGASYSL QTHYDAKLDR YLDSLITLFA
AAQEPDGYLY TIRTILKDSA NIKDDQAGPT RFSYVAGSHE LYNVGHMYEA AVAHFQATGK
RSFLNIALKN ADYLMRTIGP DKLIVVPGHQ EIEIGLVKLY RVTNDKRYLD FARFLLDMRG
RADKRPLFPD PAKTGQGASY LQDHLPVTQQ KTAVGHSVRA GYMYAAMSDI AAIQKDKAYM
DALLAIWNDV VERKQYLTGG LGARGHGEAF GEAYELPNDV AYAETCAAVA NMLWNHRMFL
LTGESKYMDV FERVLYNGFL AGVSLEGDSF FYVNPLASDG KRKFNVGQAA TRAPWFGTSC
CPTNVVRFLP SLPGYVYATK GDNLFINLFL TNQSKLSVNG KSVQIRQETN YPWDGNVAIT
VQPKLAQTFT IQLRLPGWAS GTPMPGYLYE YVNTTAKTPV LLVNGKPVPY KIENGYARIS
RTWKPGDRLE WTLDMPVREV KANEQVTDDR KKVAIERGPL VYCAEGVDNG GQALSLAVPA
GTTFRPLMQP DKLGGILSLS GQEAGKSVTL IPYYAWSHRG PNEMAVWFGQ MKPDR