Gene Slin_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2233 
Symbol 
ID8725973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2704035 
End bp2705174 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content52% 
IMG OID 
ProductL-asparaginase, type I 
Protein accessionYP_003387053 
Protein GI284037123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.335381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.30257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTATA AAACCGTTCA CATAAGCCCT GTTTCACCCC GGCAGTCGCG GGCGTCGGTG 
CTGGTCATTT ATACCGGCGG CACCTTCGGA ATGATTTACG ACCCCAAAGC CAACGGACTG
ATTCCCTTTG ATTTTGACCG GGTACTGGAT CGCGTGCCTG AGTTGGATCG GCTCGATTTC
GATATTACGA TTCTTACTCT GCATGAAGTC ATTGATTCCT CGAACATGAA ACCTGCCATC
TGGGTCGAAT TGGCAAAGAT CATTCAGGAT ACCTACGACG AGTACGATAG TTTTGTCATT
CTGCACGGCA CCGACACCAT GTCCTACACG GCATCGGCCT TAAGCTTTAT GCTGGTTGGG
CTGAACAAGC CGGTTATTCT GACCGGGGCT CAACTACCAA TCGGTGTGGC CCGCAGCGAC
GCCCGCGAAA ACTTCATCAC CGCCCTCGAA ATTGCTGCCG CAGTGGATAC TGCCGCAGTG
GATACTGCCG CAGTGGACTC GGCCGCCGTG GATGGGGGTG GGGGACTGGC CCCTGCCAAG
GGGATTCCCC TTGTTCCGGA AGTGTGTTTG TATTTCAATT CGTTACTGCT GCGGGGCAAC
CGGTCTACCA AGCAGGAAAG TGTTCAGTTT AACGCCTTTA TCTCCGAAAA TTATCCACAT
CTGGCCACGG CAGGAGTAAG CATTGATTAT AACCGATTGT TTATTCGACC TTATCAGCCC
GGTCAGCAAT TGAGCCTGCG CACCACGCTC GATCCGAACA TAACCATTCT TAAACTCTTT
CCGGGGATTA CACAGCCGGT AGTTGAGTCG ATTGTGAACA TCCCGGATCT GCGGGCGGTG
GTTCTGGAAA CGTTTGGGGC TGGTAACGCT CCGACTGATA GCTGGTTTCT GGATACGATC
AAACGGGCTA TTGACCGGGG AGTGGTATTT TTTAACGTGT CGCAGTGCGA AGGCGGGCGT
GTAACCCAGG GTCAGTACCA GACCAGTAAA CAGCTGCTGC AAATTGGTGT AGTAAGTGGG
ACCGATATAA CAACCGAAGC CGCCGTTACC AAACTGATGG TTTTGCTCGG TCAAGAACAT
GACCCGGCGA AACTGCGCGT GCTTCTGACT CAGTCGATTA GTGGCGAAAT GAGTGAATAA
 
Protein sequence
MPYKTVHISP VSPRQSRASV LVIYTGGTFG MIYDPKANGL IPFDFDRVLD RVPELDRLDF 
DITILTLHEV IDSSNMKPAI WVELAKIIQD TYDEYDSFVI LHGTDTMSYT ASALSFMLVG
LNKPVILTGA QLPIGVARSD ARENFITALE IAAAVDTAAV DTAAVDSAAV DGGGGLAPAK
GIPLVPEVCL YFNSLLLRGN RSTKQESVQF NAFISENYPH LATAGVSIDY NRLFIRPYQP
GQQLSLRTTL DPNITILKLF PGITQPVVES IVNIPDLRAV VLETFGAGNA PTDSWFLDTI
KRAIDRGVVF FNVSQCEGGR VTQGQYQTSK QLLQIGVVSG TDITTEAAVT KLMVLLGQEH
DPAKLRVLLT QSISGEMSE