Gene Slin_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3972 
Symbol 
ID8727730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4770633 
End bp4772420 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content56% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388761 
Protein GI284038831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000648751 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTAT CTCACCCGAC CGGGACGGTT CGACGCATCG ACGGTCGGCA ATACTTACCA 
TACATGAATC GACGTAATTT TATTGAACAA CTATCCGTAA GCAGTGGAGC CGCGTTAACC
GCTCCACTAC TTACCTTTTC GGGAGAATCA CACGCCCAGG CCAGTGGGCT GCTGAACCGG
CTCGGGCAGG CCGAACGAGC CGCCGATGTC GTCATTGCCG GTGGTGGGCT GGGGGGCTGT
GCGGCTGCTT TAGCCGCTTT GCGAAACCGC CTGACGGTCA TTATGACCGA AGAAACCGAC
TGGATTGGCG GGCAGATGAC CCAGCAGGGC GTGCCACCCG ATGAACACCA ATGGATAGAA
ACCCACGGGG CTACCCAGCT TTACCGCGAT CTGCGCACGG GTATTCGGGA TTATTACAAG
CAGCATTATC CGCTGACGGA TGCTGCCAAA GCGAGCAAAT TCCTGAATCC AGGCGATGGA
GCCGTATCGC GCCTGTGTCA TGAGCCGAAG GTTGCCCTGG CGGTTTTACA GCAGATGATG
GCGCCGTACC AAAGTTCCGG CCAGTTGACG TTGCTGCTCG AACACAAGAT CACCTCGGCC
GATGTTCAGG GCGATAAAGT TCGTGCGCTG AAAGCCATCA GCCAGCGAAC GGGGAAAGAA
ACGGTGCTAA CGGCCCCCTA TTTCGTCGAT GCCACCGAAC TCGGCGACCT GTTGCCCCTC
ACCGGTACCG AGTTCGTTAC GGGTGCCGAA GCCCGTTCCG AAACGCGCGA ACTCCACGCC
CCCGACAAAG CCGACCCCAA CAACTGCCAG GCGTTTACGG TCTGCTTTGC GATGGATTAC
ATAGCGGGGG CGAATCACGT TATCGATAAG CCCAAAGACT ACGCTTTCTG GCGGAACTAC
TCGCCCAAGG TTACGCCCGC ATGGTCGGGG AAACTGCTCG ACCTGTCGTA TTCGAACCCG
AAAACACTGG AGCCCAAACA GCTTGGTTTC CACCCGGAAG GCATTGCCAC CGGCGATAAA
CTGAACCTCT GGAATTACCG TCGGGTGATC AGCAAGGCCA ATTTCAAACC CGGAACGTAT
GCTGGCGATG TGTCGGGCGT GAACTGGCCC CAGAACGATT ACAACGCCGG AAACCTGATT
GGAGCCAGCG AGAAAGATTT TAAGAAATAC GTTGAGCAGG CCAAGCAACT GAGTCTGTCG
TTGCTCTACT GGCTCCAGAC CGAAGCCCCC CGCCCCGACG GCGGACAGGG CTGGCCGGGC
ATCCGATTTC GGCCGGATGT GATGGGTAGC GAAGACGGGC TGGCCAAATA CCCATATGTT
CGGGAATCCC GCCGAATCAA AGCCGTCTTT ACCGTTCTCG AAGAGCATGT CGGTGCCGAA
AACCGGGCGA TGATCACGGG TAAAAAAGAG GGCAACACCT CAGCCGATTT TCCGGATAGT
GTGGGTGTGG GCTATTACCA CATTGATCTG CACCCCAGTA CCGGCGGCAA CAATTATATT
GACTTTAGCT CCATGCCGTT TCAGATTCCG CTGGGGGCTC TGCTGCCTAA ACGTATGGAA
AATCTGTTAC CCGCCAACAA AAACATCGGC ACTACGCACA TCACCAACGG CTGTTACCGG
CTGCACCCCG TCGAATGGAG CATTGGCGAA GCGGTAGGGA TGCTGGTCGC CTATTCCTTC
AACAAAAAAG TAATCCCCCG TGCAGTTCGG GAGAAGGAAC AACTTTTGGG TGATTTTCAG
AAGCTGATTC GTTCGCAGGG TATAGAAACG AATTGGCCTA AAGCGTAG
 
Protein sequence
MTLSHPTGTV RRIDGRQYLP YMNRRNFIEQ LSVSSGAALT APLLTFSGES HAQASGLLNR 
LGQAERAADV VIAGGGLGGC AAALAALRNR LTVIMTEETD WIGGQMTQQG VPPDEHQWIE
THGATQLYRD LRTGIRDYYK QHYPLTDAAK ASKFLNPGDG AVSRLCHEPK VALAVLQQMM
APYQSSGQLT LLLEHKITSA DVQGDKVRAL KAISQRTGKE TVLTAPYFVD ATELGDLLPL
TGTEFVTGAE ARSETRELHA PDKADPNNCQ AFTVCFAMDY IAGANHVIDK PKDYAFWRNY
SPKVTPAWSG KLLDLSYSNP KTLEPKQLGF HPEGIATGDK LNLWNYRRVI SKANFKPGTY
AGDVSGVNWP QNDYNAGNLI GASEKDFKKY VEQAKQLSLS LLYWLQTEAP RPDGGQGWPG
IRFRPDVMGS EDGLAKYPYV RESRRIKAVF TVLEEHVGAE NRAMITGKKE GNTSADFPDS
VGVGYYHIDL HPSTGGNNYI DFSSMPFQIP LGALLPKRME NLLPANKNIG TTHITNGCYR
LHPVEWSIGE AVGMLVAYSF NKKVIPRAVR EKEQLLGDFQ KLIRSQGIET NWPKA