Gene Slin_5082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5082 
Symbol 
ID8728847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6213907 
End bp6216285 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content57% 
IMG OID 
ProductZn-dependent aminopeptidase 
Protein accessionYP_003389856 
Protein GI284039926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.546478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGA TAGTTTTGCT GTTGGCGAGT TTGGCCATCC CGTTCGGGCT GGTTCAGGCG 
CAAGCCCCAA CACAGCATGC TAACACGCGC TTTGAGCAGT TAGGGCCTTT GCTTCCTACG
CCCAATACGT TCAGAACAGC CTCCGGTGCG CCGGGAAAAG ACTACTTCCA GAATCGAGCT
GACTACGACA TTAAAGCAAC GCTCGATGAT ACCAAACAGC ATATAACCGG TTCGGAAACC
ATCACTTACC ATAACAACTC GGGCGATGCG CTTCCGTATC TGTGGCTTCA ACTGGACCAG
AACCTGTTCC GGCCCGATGC CAACGGTAAT ACCACCGAAA CCAACAGTAT CAACGCTCAG
CGGGGTATGA CCGCAGAGCA GCTCGACCCC AGCTCAGTAT TGAAAGGCAA AGATTACGGG
CACAAGATCA CGGCGGTGCG CGATCAGGCG GGAAAGGCCC TCAAGTACAC CATCAACCAG
ACCATGATGC GCATCGACCT GCCCCAGCCT GTTGGGCCCG GCAAGTCGGT TGTATTTTCT
ATTGACTGGA ACTTCAATAT TGTTGATGCG AAAGCTACCC ACGCCCGGTC AGGGTACGAG
TATTTCCCAA AGGACGGTAA CTACGTCTAT GAAATTGCTC AATGGTTCCC CCGCCTGTGC
GCTTACAGTG ATGTGACAGG CTGGCAGAAC AAACAGTTTC TGGGACAGGG GGAGTTCACC
CTCATCTTTG GCAACTACAA GCTGGCCCTT ACCGTCCCCA ACGACCACAT CGTGGGCGCT
ACGGGCGAGT TGCAGAACCC GGCCCAGGTG CTGACCCCTA TCCAGATCAA ACGCTGGAAC
GAAGCCAAGG GCAAGGGTGA CAAGCCCGGC GAAAACCCCA CCGTCATCGT GACACAAGCC
GAAGCTGAAG CGGCCGAAAA AGGTAAGCCG ACAGGCACCA AGACATGGGT ATTCAAGGCC
GATAACGTCC GCGATTTCGC GTTTTCCAGC AGTCGTAAGT TCATCTGGGA TGCGCTGAAC
CCCAACGTAG AAGGCAAGCG GGTATGGGCC ATGTCACTTT ACCCTAAAGA AGCGAATCCA
CTCTGGGGTC AATACTCGAC CCGGCTGGTA GCGCACACAT TACGTTCGTA CTCTCGCCGG
ACCATCGCTT ACCCCTATCC GGTGGCCTAC TCGGTTCACG GACCCGTGGG CGGCATGGAG
TACCCCATGA TGTGCTTCAA CGGTGCCCGC CCCGAAGAAG ACGGAACGTA TTCGGAAGGC
ACTAAAAACT TTTTGATTTT GGTCGTTATC CACGAAGTGG GTCACAACTT TTTTCCCATG
ATCGTCAACT CCGACGAGCG GCAGTGGTCC TGGATGGACG AAGGACTCAA CAGCTTCCTT
GAAGGGGTAA CCTGTTTAGA GTGGGACGCC AACTTCCCGG CGCGGGGCAT TCAACCCCAA
TCGATCGTGT CTTACATGCG ACTTGATTCG ACCCAGCAGG TGCCCATCAT GAGCAGTTCG
GATAATATTT TACCGAATAC CTTCGGGCCA AATGCTTATG ACAAACCCGC TACGGCGCTG
AACATCCTGC GCGAGACGGT CATGGGCCGC AAGCTGTTCG ACTACGCCTT CAAAGAGTAC
GCCCGTCGAT GGGCCTTCAA GTCCCCCGAA CCCGCCGACT TCTTCCGAAC CATGGAAGAT
GCCTCCGGCG TTGACCTCGA CTGGTTCTGG AAAGGCTGGT TCTACGGCGT GCAGCCCGTC
GACCAGGCAC TGGTCAAAGT CGACTGGTTC CAGGCCGGGT CACAAAACCC CGAGATCGCC
AAGGCCGAAG CCCGGGCAGC CGCAGCCAAA CGTGCCAACA CCATCAGCAA GCAACGCGAT
GCCGCCACCA AAGGCCAGAC CGTCGTCGCC CAGGACTCCA CCATGAAGGA CTTCTACAAC
AGCTACGATC CCTACGCCGT TACCGAGGAA GACAAGAAGA AGTACCAGGA CTACCTGGCC
ACACTCACCC CTGAGGAGCG CAAACTGGCC GAAGCAGGCA CCAACTTCTA CACCCTCTCG
CTCAAGAACA AGGGCGGCAT CCCCATGCCG GTGATCGTGC GCATGGAGTT CGAGGATGGC
ACCGACTCGG TGGCGCGCTT CCCGGCCGAG ATCTGGCGCT TCAACGACGT GTCGATCAAC
AAGGTGATTG CCACCAGCAA GAAAGTGAAG CAGTGGACGC TGGACCCTTA CTACGAGATT
GCCGACATCA ACACGGAGGA CAACAGCTTC CCGCCGGTGG CTCAGCCGAC GCGGTTCCAG
TTGTTCAAAC AGCAGCAGCG GGGCGGTGGG GCCGCGCCTA ACCCCATGCA GCAGCAACGT
CAGCAACAAC CCGCCAAACA AGGCACCGGT CGAAATTAA
 
Protein sequence
MQKIVLLLAS LAIPFGLVQA QAPTQHANTR FEQLGPLLPT PNTFRTASGA PGKDYFQNRA 
DYDIKATLDD TKQHITGSET ITYHNNSGDA LPYLWLQLDQ NLFRPDANGN TTETNSINAQ
RGMTAEQLDP SSVLKGKDYG HKITAVRDQA GKALKYTINQ TMMRIDLPQP VGPGKSVVFS
IDWNFNIVDA KATHARSGYE YFPKDGNYVY EIAQWFPRLC AYSDVTGWQN KQFLGQGEFT
LIFGNYKLAL TVPNDHIVGA TGELQNPAQV LTPIQIKRWN EAKGKGDKPG ENPTVIVTQA
EAEAAEKGKP TGTKTWVFKA DNVRDFAFSS SRKFIWDALN PNVEGKRVWA MSLYPKEANP
LWGQYSTRLV AHTLRSYSRR TIAYPYPVAY SVHGPVGGME YPMMCFNGAR PEEDGTYSEG
TKNFLILVVI HEVGHNFFPM IVNSDERQWS WMDEGLNSFL EGVTCLEWDA NFPARGIQPQ
SIVSYMRLDS TQQVPIMSSS DNILPNTFGP NAYDKPATAL NILRETVMGR KLFDYAFKEY
ARRWAFKSPE PADFFRTMED ASGVDLDWFW KGWFYGVQPV DQALVKVDWF QAGSQNPEIA
KAEARAAAAK RANTISKQRD AATKGQTVVA QDSTMKDFYN SYDPYAVTEE DKKKYQDYLA
TLTPEERKLA EAGTNFYTLS LKNKGGIPMP VIVRMEFEDG TDSVARFPAE IWRFNDVSIN
KVIATSKKVK QWTLDPYYEI ADINTEDNSF PPVAQPTRFQ LFKQQQRGGG AAPNPMQQQR
QQQPAKQGTG RN