Gene Slin_6602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6602 
Symbol 
ID8730388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp8018760 
End bp8019905 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID 
ProductHeparan-alpha-glucosaminide N-acetyltransferase 
Protein accessionYP_003391358 
Protein GI284041428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAT TACAGACGCC TATTGAACCG GCTATCGCTC AGCCAACACC CGTTAAACGA 
CTGCTCTCCC TCGATACGCT GCGCGGCTTC GATATGTTCT GGATTATGGG CGGGGAAGAA
ATCTTTCACG TGCTGGCCAA AACGACCGGC TGGGCGGGGG CTATTCTATT AGCCGATCAG
TTTTCGCACC CCGCCTGGAA CGGGTTTCGG GCCTACGACC TCATCTTTCC GCTGTTCATG
TTCATGGCGG GCGTGTCCAC CCCTTTTTCG GTCGGGTCGC GGCTCGATCA GGGAACAGAC
AAAGCGAAGA TTGCCCGCAA GATCATCAGC CGGGGGCTCA TTCTGGTTGT GCTGGGCATC
ATTTACAACA ACGGGCTATT TAACCGGGTT TTCGAGGACA TGCGTTTCCC AAGTGTACTG
GGTCGCATTG GTCTTGCCGG GATGTTTGCC CAATTGATCT ACCTGTATTT CCGGCCCCGC
GCTCAGTACA TCTGGTTCGT GGGACTGTTG CTGGGCTACT GGGCGCTGAT GATGCTGGTG
CCGGTACCCG GATGTGGGGC GGGGGTACTC ACCATGGAAT GCAATCTGGC CAGCTTTATC
GACCGAATGC TGGTGCCGGG TCATTTGTAC AAAACGATTC ATGACCCGGA AGGCCTGTTT
TCGACACTCC CGGCCATCGA CAATACCTTG CTGGGTATTT TTGCCGGTAC ATTTCTGCGG
ACGCATGGCA GAACGGGCAA TCAGAAAACG GCGCTGCTGC TCGGTGCCGG AGCGGCTTTT
GTACTACTCG GCTGGCTTTG GGATTTTGTT TTCCCCATCA ACAAAAACCT CTGGACCAGT
TCGTTCGTGC TGGTTACGGG TGGATTGAGC CTGTTACTGC TGGCCGTATT CTACTGGGTT
ATTGACGTAA AAGGTATCAA ACGCTGGACG TTTTTCTTCA CGGTCATCGG CATGAATTCC
ATTCTGATTT ACCTGGCGGG CGAATTCATC GACTTTGAGT ACGCGGCCCG TTTTTTCTTC
GGCGGTCTGC TCAAACAGTC ATCTTCCGAA GTGGTCACTG CTGTTGGGGA GGTAATTGCC
TTTCTGGCCG TCAAATGGGC CTTTTTGTAC ATACTTTACA AGAAAAAAGT ATTCCTGCGG
GTGTAA
 
Protein sequence
MSTLQTPIEP AIAQPTPVKR LLSLDTLRGF DMFWIMGGEE IFHVLAKTTG WAGAILLADQ 
FSHPAWNGFR AYDLIFPLFM FMAGVSTPFS VGSRLDQGTD KAKIARKIIS RGLILVVLGI
IYNNGLFNRV FEDMRFPSVL GRIGLAGMFA QLIYLYFRPR AQYIWFVGLL LGYWALMMLV
PVPGCGAGVL TMECNLASFI DRMLVPGHLY KTIHDPEGLF STLPAIDNTL LGIFAGTFLR
THGRTGNQKT ALLLGAGAAF VLLGWLWDFV FPINKNLWTS SFVLVTGGLS LLLLAVFYWV
IDVKGIKRWT FFFTVIGMNS ILIYLAGEFI DFEYAARFFF GGLLKQSSSE VVTAVGEVIA
FLAVKWAFLY ILYKKKVFLR V