Gene Sked_21150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSked_21150 
Symbol 
ID8633750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSanguibacter keddieii DSM 10542 
KingdomBacteria 
Replicon accessionNC_013521 
Strand
Start bp2360845 
End bp2362131 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content71% 
IMG OID 
Productcarbohydrate-binding protein 
Protein accessionYP_003314869 
Protein GI269795414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.396929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.174783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCC GACGACGACT GCTCGCCCCC ACCGCGATCG CCGCAGCCGC AGCACTGCTG 
CTCGCCGGCT GCGGCCGCGA CACCGACTCC CCCGCACAGG CCGACGACGC CGTCGAGCTC
ACGAGCGGCC CTGCCTCCGG CACCGTGACC ATCTGGGCCC AGGGCACCGA GGGCGAGGCG
CTCACCGAGT TCATCAAGCC CTTCGAGGAG GCCAACCCCG ACGTCACCGT CAAGGTCACC
GCAGTCCCCT GGGACTCCGC GCAGAACAAG TACCAGACGG CCGTGGCGGG AGGGACGACC
CCCGACATCG GCATGCTCGG CAGCGACTGG ATGCCCACCT TCGCCAACGC CCTCCAGCCC
AAGCCCGAGG CGATCGACAC CTCCGGCATG TTCGACTTCG CGACGACGTC GACCACCTTC
GACGGCGTCG AGTACGCCGT CCCGTGGTAC GTCGAGACCC GCGTGCTGTT CTACCGCACC
GACCTCGCCG AGCAGGCCGG GTTCGACACC TTCCCCACCG ACTGGGACGG CTTCAAGGCC
CTCGCCCAGG CCTACCAGGA CGAGGCCGGC GCCGAGTACG GCGTCGCGCT CCCCTCCGGC
GGCTGGAACT CGTTCCTCAG CAGCCTCCCC TTCGCGTGGT CCAACGGGGC CGAGGTCATG
GACGCCGACC AGACCACCTG GACGCTCGAC ACCCCCGAGG TCACCGGCGC CGTCGACTAC
ATCGACAGCT TCTTCGCGGA CGGCATCGCC AACCGCAACC CTGACGCCGA GGCCGGCTCG
ACCACCGCGG CCTTCGTCGA CGGCTCCGTC CCGATGTTCA TGAGCGGGCC GTGGGACATC
CCCGGGCTCA AGACCGCCGG GGGCGAGGGC TTCGAGGACA AGTTCGGCGT CGGTCTCGTC
CCCGCGTCGC CGGACGGCAC CTCGACCTCC TTCGCCGCCG GCGCCAACCT CGCGGTCTTC
AAGGACGCCG AGAACCCCGA GGCCGCGTGG AAGCTCGTCG AGTGGCTCAG CCAGCCCGAG
GTCCAGGTCG ACTGGTTCAG CGCCGTCAAC GACCTCCCCG CCCAGGAGTC CGCCTGGGAC
GACCCGACGC TCACCGCAGA CCCCAAGGTC GCCGGCTTCG GCGAGCAGCT CAAGAGCGTC
AAGATCGCCC CGACCCTCAC CACCTGGCCC CAGGTCTCTG CCGCGGCAGA CACCCAGCTC
GAGCAGATCC TGCGCGGCGA CAAGGACCCG GCGGTCGCCC TCGGCGAGCT GCAGAGCACG
GCGGACTCCC TCGGGACCGG GCAGTGA
 
Protein sequence
MMIRRRLLAP TAIAAAAALL LAGCGRDTDS PAQADDAVEL TSGPASGTVT IWAQGTEGEA 
LTEFIKPFEE ANPDVTVKVT AVPWDSAQNK YQTAVAGGTT PDIGMLGSDW MPTFANALQP
KPEAIDTSGM FDFATTSTTF DGVEYAVPWY VETRVLFYRT DLAEQAGFDT FPTDWDGFKA
LAQAYQDEAG AEYGVALPSG GWNSFLSSLP FAWSNGAEVM DADQTTWTLD TPEVTGAVDY
IDSFFADGIA NRNPDAEAGS TTAAFVDGSV PMFMSGPWDI PGLKTAGGEG FEDKFGVGLV
PASPDGTSTS FAAGANLAVF KDAENPEAAW KLVEWLSQPE VQVDWFSAVN DLPAQESAWD
DPTLTADPKV AGFGEQLKSV KIAPTLTTWP QVSAAADTQL EQILRGDKDP AVALGELQST
ADSLGTGQ