Gene Dshi_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2021 
Symbol 
ID5713016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2141284 
End bp2142312 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content65% 
IMG OID641267945 
Productputative binding protein component of ABC iron transporter 
Protein accessionYP_001533361 
Protein GI159044567 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.104601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTAC GACTTTCTCT GCCGCTTGCC GGTCTGACCG CCGCGGCCAC GCTGCTCGGT 
TCCGTGGCCT ATGCTGCGGG TGAGCTGAAC CTCTATTCGT CGCGTCACTA CGACACGGAC
GAGCGGCTCT ACTCGGATTT CGAAGAGGCC ACGGGCATCA CCGTAAACCG GATCGAGGGT
AACGCCGACG AACTGATCGC GCGGATGGAG GCCGAGGGCG CCAACAGCCC GGCGGACGTA
TTCCTGACCG TGGACACGGT GCGTCTGGCA CGGGCCAAGG ATCTCGGCCT GCTGCAATCG
GTGGACAGCC CGATCCTCGA GGGGCGCATC CCGGCCTACC TGCAGGATGA CGACAACCAG
TGGTTCGGCT TCTCGCAGCG CGCGCGCATC CTGTTCTACG ACAAGACCGA CGTGGAAAAC
CCGCCGGCCA CCTATCAGGA CCTGGCGAAG CCGGAATATG AGGGCATGGT CTGCATCCGG
TCCTCCACCA ACGTCTATAC CCAGAACATC GTCGCGGCCC TGATCGAGCA TCTGGGCGAA
GAAGCGGTGA CCGACTGGGC CAAGGCCGTG GTCGGCAACT TCGCCCGCGC GCCTCAGGGC
GGCGATACCG ATCAGCTGCG CGGCATCGCC TCGGGCGAGT GCGACATCGC GATGTCGAAC
ACCTATTACT ACGCCCGCGC GACCCGGAAG GGCGACAGCA CCATGTCCGA GGAAGACCTC
GCAAATATCG GCTGGGTGTT CCCGAACCAG AACTCGATCG GGGCGCATAT GAACATCTCC
GGCGGCGGGG TGGCCGCGAA CGCGCCGAAC CGCGACAACG CGGTGAAGTT CCTCGAGTAC
CTGTCGTCCG TGCAGGCGCA GGAGTATTTC TCGGCCGGCA ATGACGAATA TCCCGCGGTG
CCCGGTGTTG GCCTTTCGCC GTCGGTTGCG GCCCTCGGCA TCTTCCGTCC GGACGTGATC
GACCTGTCGG CCATCGGCAA CAATGTCGAC GCAGCCCAGC GCGTGCTGAC CGCGGCCGGC
TGGGAGTAA
 
Protein sequence
MPVRLSLPLA GLTAAATLLG SVAYAAGELN LYSSRHYDTD ERLYSDFEEA TGITVNRIEG 
NADELIARME AEGANSPADV FLTVDTVRLA RAKDLGLLQS VDSPILEGRI PAYLQDDDNQ
WFGFSQRARI LFYDKTDVEN PPATYQDLAK PEYEGMVCIR SSTNVYTQNI VAALIEHLGE
EAVTDWAKAV VGNFARAPQG GDTDQLRGIA SGECDIAMSN TYYYARATRK GDSTMSEEDL
ANIGWVFPNQ NSIGAHMNIS GGGVAANAPN RDNAVKFLEY LSSVQAQEYF SAGNDEYPAV
PGVGLSPSVA ALGIFRPDVI DLSAIGNNVD AAQRVLTAAG WE