Gene Slin_6020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6020 
Symbol 
ID8729801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7301602 
End bp7302657 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content56% 
IMG OID 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003390781 
Protein GI284040851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0853695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.491658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCC ACATCAGCAC GTACGGAACG TACCTGCACG TTAAAGATGC CATGTTCGAC 
GTTCGGCGGA AGGGCGAAGA TGGGAAGGTC ATCAGCGCGA CCTATTCGGC CGAGAAGGTG
ACGCATATTC TGCTGGCAAC GGGCACATCG CTCAGTACCG ATGCCGTGCG GCTGGCCATG
CGGCACAATG TCGACATCGT GTTTATCGAG CAACAGGGCG ACCCCATTGG GCGGGTATGG
CACGCCAAAC TGGGCAGCAC CACCAAAATC AGAAAGCGAC AGCTGGAGGC CAGTCTGGGG
CCGGATGGGC TGCGGTGGGT GCGGGCCTGG CTGCTGGCCA AACTCGACAA TCAGATGGGC
TTTATCCGAA GTCTGAAAAA ACACCGCCCC CAGCATGCCG GCTATCTGGA TGATAAACTA
GTTCGGATAG AAGCTATGGC TTTGTCGATC AGCACACTCG CGTCGGTTGG TGAGCAAACT
CCAGCGACTA CCTGCGTAGC CGATGTGGCC GATACCCTGC GCGGACTGGA GGGTACGGCG
GGTCGGTTGT ATTTCGAGAC GCTGAGCTAT GTATTGCCCA AAGAATATCA GTTTAGCGGG
CGCAGTAGTC GGCCAGCGCA GGATGCCTTC AACGCCTTTC TGAATTATGG CTACGGCATG
TTGTACGGAA AAGTGGAAAA AACGCTGATG ATGGCTGGCC TCGACCCGTA TGTGGGGTTT
CTGCATCGCG ACGATTACAA CCAGCTGAGC ATGGTGTATG ACTTCATTGA GCCGTATCGG
GGCTGGACCG ATGAAGTGGT GTTTCGGCTC TTTACGGCCA AAAAAGTCAA TAAAGCGCAC
ATAGGTGAGG TGTCGGGGTC GCGAACGGGG GTTTCGCTCA ATGCCGATGG CAAGGCATTG
CTGGTGAATG CGTTCAACGA GTGCATGGAT AATGACCCCA TTCGGTATCG AGGCCGCAAC
CTCGTCCGAA GCCATTGCAT GCAACTCGAC GCGCATCAGT TTGCGAATGA ACTCATCGGA
AAAACGGGTG GTCTGCCCGA CCTCGTCAAG CTATGA
 
Protein sequence
MQLHISTYGT YLHVKDAMFD VRRKGEDGKV ISATYSAEKV THILLATGTS LSTDAVRLAM 
RHNVDIVFIE QQGDPIGRVW HAKLGSTTKI RKRQLEASLG PDGLRWVRAW LLAKLDNQMG
FIRSLKKHRP QHAGYLDDKL VRIEAMALSI STLASVGEQT PATTCVADVA DTLRGLEGTA
GRLYFETLSY VLPKEYQFSG RSSRPAQDAF NAFLNYGYGM LYGKVEKTLM MAGLDPYVGF
LHRDDYNQLS MVYDFIEPYR GWTDEVVFRL FTAKKVNKAH IGEVSGSRTG VSLNADGKAL
LVNAFNECMD NDPIRYRGRN LVRSHCMQLD AHQFANELIG KTGGLPDLVK L