Gene Dshi_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2023 
Symbol 
ID5713018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2142820 
End bp2144499 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content68% 
IMG OID641267947 
Productputative ABC transporter permease component 
Protein accessionYP_001533363 
Protein GI159044569 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.967123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0816083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCC TGTCCGATCC CTCTGGCGAG CGTCCCGTGA AAAGCCGTGC CCGCAAGCGC 
CGCGAGGCCC GGCTGCTGGG CACGGGCGCG CTCGTGCTCG CGGCGCTCTG CCTGCTGCCC
ATGCTCGCGG TCCTGATCAC GGCGCTGTCG GGGGGGACCG ATACCCTGGC CCAACTCGCC
GACACCGTGC TGCCCGGCTA TACCGGCGCA ACCCTGGCGC TGGTGGTGCT CGTCGGCACC
GGCACCTTCA TGATCGGGAC CTCCACAGCC TGGCTCATAT CGATGTACGA GTTCCCTGGT
CGCCGCTGGC TGGAGGTGCT GCTGGTCCTG CCGCTGGCCT TTCCGGCCTA TGTGCTGGCC
TATGCCTATA CCCATGTGCT CGACCATCCC GGCATCGTTC AGGCGACGCT GCGATCGGTG
ATGGGCTGGG GACCGCGCGA CTACTGGTTT CCCGAGATCC GCTCGCTGGG CGGGGCTGCG
GCCATGCTGA TCCTCGTGCT CTACCCGTAT GTCTATCTGC TCGCCCGCGC GGCTTTCGTG
CAGCAGAGCG CCACCACTTT CTTTGCCGCC CGCGCCCTGG GCCGCACGCC TTTCCGCGCC
TTCCTGGAGG TCTCCATGCC CATGGCGCGC CCTGCCATCG CCGCGGGCGT CCTGCTGGCC
ACCATGGAGA CCATCGCCGA TTTCGGCACC GTGTCCTATT TCGGCGTCCA TACCTTCGCC
ACGGGCATCT ATACCAGCTG GTTCAACATG GGGGACCGGG TGGCCGCGTC CCAGCTTGCC
CTCGGGCTTC TGGGCTTCGC GCTCCTGCTC GCGGTGCTGG AGCGTCAAAG CCGCGGTTCT
GCCAAGTACC ACGGCGGCAA GCGGCAGGAG GCCATGCCCC GCACCACCCT GACCGGCTGG
CACCGCTGGA GCGCGACGAT CCTGTGCGGC GCGCCGGTGC TTCTCGGCGT GGCCATCCCG
ATCGTCACCC TGCTGGTCAT GGGCATCGGG TCCGAGCAGA ACCTGCTCAG CCGCCGCTAC
ATTCGTTTCA TCACGAATTC CCTGACCCTG GCCTCGGCGG CGGCGGTTCT GACGGTCTGC
GCGGCGGTGA TCCTGGGGTA CTACCAACGC GTCCGCCCCG GCCCGCGCTC GGACGCGGCT
CTCTATATCG CGCGGCTCGG CTACGCGATC CCGGGCGGGG TGATCGCGGT AGGGCTTCTG
GTGCCCTTCG CGCTCTTCGA CAACACGCTC GATGCCTGGA TGCGCGCCAA TTTCGACCTC
TCCACGGGGC TGCTGCTGAC CGGATCGATC TGGCTTCTGG TGGGGGCCTA CATGATCCGG
TTTCTCGCCG CGGCGCTGGG CGCCTACGAG GGCGGACAGG CGACGATCAA CCTCAATCTC
GACTATGCCG CGCGGGTGCT GGGCCAGACC GCCTACGGCA CCCTGCGTCG GGTGCACCTG
CCGATCCTGA CGCCAAGCCT GCTGACGGCG CTGCTGATCG TGTTCGTCGA CGTTATGAAG
GAATTGCCCG CGACGCTCAT CATGCGGCCC TTCAACTACG ACACGCTGGC GGTGCAGGCC
TACCGGCTGG CCTCAGACGA ACGGCTCGAA GGGGCGGCCG TGCCCAGCCT GCTGATCGTG
GCCGTGGGGC TCTTGCCGGT TATCCTGCTC TGCCGCCAGG TCCGCCGCCA ATCGCGCTGA
 
Protein sequence
MATLSDPSGE RPVKSRARKR REARLLGTGA LVLAALCLLP MLAVLITALS GGTDTLAQLA 
DTVLPGYTGA TLALVVLVGT GTFMIGTSTA WLISMYEFPG RRWLEVLLVL PLAFPAYVLA
YAYTHVLDHP GIVQATLRSV MGWGPRDYWF PEIRSLGGAA AMLILVLYPY VYLLARAAFV
QQSATTFFAA RALGRTPFRA FLEVSMPMAR PAIAAGVLLA TMETIADFGT VSYFGVHTFA
TGIYTSWFNM GDRVAASQLA LGLLGFALLL AVLERQSRGS AKYHGGKRQE AMPRTTLTGW
HRWSATILCG APVLLGVAIP IVTLLVMGIG SEQNLLSRRY IRFITNSLTL ASAAAVLTVC
AAVILGYYQR VRPGPRSDAA LYIARLGYAI PGGVIAVGLL VPFALFDNTL DAWMRANFDL
STGLLLTGSI WLLVGAYMIR FLAAALGAYE GGQATINLNL DYAARVLGQT AYGTLRRVHL
PILTPSLLTA LLIVFVDVMK ELPATLIMRP FNYDTLAVQA YRLASDERLE GAAVPSLLIV
AVGLLPVILL CRQVRRQSR