Gene Dshi_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2197 
Symbol 
ID5713850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2324302 
End bp2325957 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content64% 
IMG OID641268119 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001533534 
Protein GI159044740 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCT ATCAGTACGT CTACCACATG GACGGCGTGT CCAAGACCTA TCCCGGCGGC 
AAGAAATGCT TCGAGAACAT CCGCCTCTCC TTCCTTCCGG GCGTCAAGAT CGGCGTCGTC
GGCGTCAACG GCGCGGGCAA GTCCACCCTA ATGAAGATCA TGGCCGGCCT CGACACCGAC
TTCACCGGGG AGGCTTGGGC CGCCGAAGGC GCCCGCGTCG GCTACCTGCC CCAGGAGCCC
GCCCTCGACG AGACCCTCAC CGTGCGCGAG AACGTCATGC TCGGCGTCGC CCCCAAGAAG
GCCATCCTCG ACCGCTACAA CGAGCTGGCG ATGAACTACT CCGACGAGAC CGCCGACGAG
ATGGCGAAGC TCCAGGACGA GATCGACGCG CAAAACCTCT GGGACCTCGA CGCCCAGATC
GACATCGCGA TGGAGGCGCT GCGCTGCCCC CCCGACGACG CCAGCCCCGC GAACCTCTCG
GGCGGGGAGC GCCGCCGCGT CGCACTCTGC AAGCTCCTGC TCGAAGCCCC CGACATGCTG
CTCTTGGACG AGCCCACCAA CCACCTCGAC GCCGAAACCA TCGCCTGGCT CCAGAAACAC
CTGATCGAGT ACAAGGGCAC CATCCTCATC GTCACCCACG ACCGCTACTT CCTCGACGAC
ATCACCGGCT GGATCCTGGA ACTCGACCGC GGCCGCGGCA TCCCCTACGA GGGCAACTAT
TCCGCCTGGC TCGACCAGAA GGCCAAGCGG CTCGAACGCG AGGCCAAGGA AGACAAGGCG
AAACAGAAAA CCCTCGCGCG CGAGCTCGAA TGGATCCGCG CCGGCGCCAA GGCCCGCCAG
GCCAAGCAGA AGGCCCGCAT CAACGCCTAC GAAGAACTCG CCGGCCAGTC GGAGCGCGAA
AAGGTCGGCA AGGCCCAGAT CATCATCCCC AACGGCCCCC GCCTCGGGAG CAAGGTGATC
GAGGTCGAAA ACCTCACCAA AGCTTATGGC GACAAGCTGC TGATCGAGAA CCTCTCCTTC
TCCCTGCCGC CCGGCGGCAT CGTCGGCGTG ATCGGCCCCA ACGGGGCGGG CAAATCCACA
CTCTTTCGCA TGCTGACAGG GCAGGAGCAG CCCGATGGCG GCACGCTCAG CTACGGCGAC
ACGGTGCAAC TGGCCTATGT CGACCAGTCC CGCGACACGC TCGACCCCGC CGCCACCGTC
TGGGAGGAGA TCTCCGGCGG CGGCGAAATC ATCGAGCTTG GCGACGCCCA GATCAACTCC
CGCGCCTATT GCGGCGCGTT CAACTTCAAG GGCGGCGACC AGCAGAAGAA GGTCGGGCTC
CTGTCGGGCG GCGAACGCAA CCGCGTCCAC ATGGCGAAAC TGCTGAAATC CGGCGGCAAT
GTCCTCCTGC TCGATGAACC TACCAACGAT CTTGACGTGG AAACGTTAAG AGCGCTTGAA
GACGCCATCG AGGATTTCGC CGGCTGCGCC GTGGTCATCT CCCACGACCG CTTCTTCCTC
GACCGCCTCT GCACCCACAT CCTCGCCTTC GAGGGCGACG CCCATGTGGA ATGGTTCGAG
GGGAACTTCG AAGCCTACGA GGAAGACAAG GCACGGAGAC TGGGGCCGGA TGCCCTCGAA
CCCAAGCGCG TGAAATACAA GAAATTCACC CGTTAG
 
Protein sequence
MAAYQYVYHM DGVSKTYPGG KKCFENIRLS FLPGVKIGVV GVNGAGKSTL MKIMAGLDTD 
FTGEAWAAEG ARVGYLPQEP ALDETLTVRE NVMLGVAPKK AILDRYNELA MNYSDETADE
MAKLQDEIDA QNLWDLDAQI DIAMEALRCP PDDASPANLS GGERRRVALC KLLLEAPDML
LLDEPTNHLD AETIAWLQKH LIEYKGTILI VTHDRYFLDD ITGWILELDR GRGIPYEGNY
SAWLDQKAKR LEREAKEDKA KQKTLARELE WIRAGAKARQ AKQKARINAY EELAGQSERE
KVGKAQIIIP NGPRLGSKVI EVENLTKAYG DKLLIENLSF SLPPGGIVGV IGPNGAGKST
LFRMLTGQEQ PDGGTLSYGD TVQLAYVDQS RDTLDPAATV WEEISGGGEI IELGDAQINS
RAYCGAFNFK GGDQQKKVGL LSGGERNRVH MAKLLKSGGN VLLLDEPTND LDVETLRALE
DAIEDFAGCA VVISHDRFFL DRLCTHILAF EGDAHVEWFE GNFEAYEEDK ARRLGPDALE
PKRVKYKKFT R