Gene Slin_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1967 
Symbol 
ID8725704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2378763 
End bp2380052 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content44% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003386811 
Protein GI284036881 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.211541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0364461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAG CCAGGTTGCA TTTGCTGTCC TCTCTCCAGA TAGAGCGCCT TCAAACCGTG 
GAAACAGATC AGATGCTGAA TGATGGAGGT GGACTTTATC TTTTCAATCG CCGTCACGGA
TCAAAAGAAT GGATTTTTCG ATACAGCTCA CCTCTTTCTG GGCAACGCCG AAAGCAGTCG
CTTGGAACAT ATCCCGATAC ATCGTTAAAA CAGGCACGTG CATTGGCCTC TTCAAAACGC
GATATAATTT CTCGTGGACT TGATCCAATT GTTGAGATGC AGAAGCATAG ACAAATCGAA
ATCACCAATG TCAATAAACG CGAAGAATAT GGCCGAAATA CGGTAAAAGT TGTTTTTCAA
GCCTGGAAGC GAGCAGAACT GCAAAACCGT AAAGATAAGG GAGATGAAAT CGAAAGGGCT
TTCGAAAAAG ACGTTTTCCC TGTGATTGGA AACAAGGTCA TAGGGGAAGT CACGCGCAAT
GATATAAAAT TTGTATTGGA GCGACCTCTC AAGCGACACG TCAATCGCAT GGCTAATCGT
CTGTTATCCG ATCTGAAACA ATTTTTTGGC TATGCTGAAG ACGAAGAACT CACTCATCAA
GATCCAACCA GACGTTTGAT AAAAGACCGT GTTGGAGGAA AGGAAAAATC CCGTCAACGT
TATCTCGATC AGAAAGAGCT TAAATTACTT GCAGGACTTC TACCTGTATC TGGATTGAAA
GCAGAATATC AACATCTGAT TTGGCTTTTG CTGGCCACTG GATGTCGTGT CAATGAGATT
CTTCGCGCCC GTTGGAGTCA TGTTGATTGG GAAAATCGAT TCTTTCATAT CCCATCAGAA
CTCTCAAAAA ATACCCTTAA GCATGTCGTG TTTCTATCCG ATTTTGCTCT CTGGCATTTA
GCCCGATTAA AGGCTACCCA GACCACGGAA TGGATCGTAC CCAATCGATC CGGTACAGGT
CCTATCACGC GTCAGGTGCT TACGAAACAG GTGACCGACA GACAACAAAA AACTTTAGGC
AATCAGCGCG TCAATAATCC TCAGGCATTA ATATTGCCCA ATGGACGTTG GGTCATTCAT
GATTTACGCC GGACAGCCGG AACACTCATG CAAGAAATCG GTATTCTGCC TTACATTATC
AAAAAATGTC TCAATCAGAA AACGGAAGAT AAGATAATTG AAACCTATCA ACGGGCAGCG
TTAACAGATC ACCAGCAAGA AGCATTTCAT AAACTCGGAA GCTATCTGGA TCAACTGACA
CAGACTACTT TAACCACCCG CTCTGTTTAA
 
Protein sequence
MPKARLHLLS SLQIERLQTV ETDQMLNDGG GLYLFNRRHG SKEWIFRYSS PLSGQRRKQS 
LGTYPDTSLK QARALASSKR DIISRGLDPI VEMQKHRQIE ITNVNKREEY GRNTVKVVFQ
AWKRAELQNR KDKGDEIERA FEKDVFPVIG NKVIGEVTRN DIKFVLERPL KRHVNRMANR
LLSDLKQFFG YAEDEELTHQ DPTRRLIKDR VGGKEKSRQR YLDQKELKLL AGLLPVSGLK
AEYQHLIWLL LATGCRVNEI LRARWSHVDW ENRFFHIPSE LSKNTLKHVV FLSDFALWHL
ARLKATQTTE WIVPNRSGTG PITRQVLTKQ VTDRQQKTLG NQRVNNPQAL ILPNGRWVIH
DLRRTAGTLM QEIGILPYII KKCLNQKTED KIIETYQRAA LTDHQQEAFH KLGSYLDQLT
QTTLTTRSV