Gene Dshi_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1942 
Symbol 
ID5712936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2031291 
End bp2032652 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content70% 
IMG OID641267867 
Productputative GSCFA family protein 
Protein accessionYP_001533284 
Protein GI159044490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.562926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000963078 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATCCA ACCCGATCGA GACCGGCCCC GCACCAGAGG TGCTGCGCCG GGCGTCCCGC 
AACCCGCTGC GCCGCTACCC CACGCCCGAC GCCGGGGGGG ACCGGCTCTA CCCGCTGGCC
ATGCCCGCGC CGACCCCGTC CTTCGAATTC AGTTCAAAAG AAACCGTCTT CGCCCTCGGC
TCCTGCTTTG CCCGCAACAT CGAGGACGCG CTGGCCGCCG AAGGCTTCCG CGTGCTCAGC
CGCGAGTTCG ACCTGGGCGA GATCGGCGCG AGCCTCGATG ATGCCGCCAA TTTCTTCAAC
AAGTATACCA TCCACTCCAT GCTGAACGAG CTGCGCTGGG CCTTCGACCG CGACAGCTTC
CCCGGCGCGG ACATGCTCTA CCCCCTGCGC GACGACGACA CCGCCTACCG CGACCTGCAA
CTGGGCTCGG CCAAGCTCGC CTTCCCGCGC GACCGCATCC TCGCCTTCCG CCACCGGTTT
CTGGATGCCG TGGCCCAGAT CGCCGAGGCC GACGTGGTGG TGATGACCCT GGGCTATGTC
GAAGGCTGGC GCGACACCCG GCTCGACCTG GCGCTCAACA CCGCGCCCCC TCCGGCACTC
TGCGCCCGCG AGCCCGACCG CTTCGCCTTC GAGGTGCTGA GCTACGAGGA TGTGCTGGGC
GGACTGCGCG CCTTCCACGC GCTGCTGACC GCCCACCGCA CCAAGCCGCT CAAGATGCTG
CTGACCGTCT CGCCCGTCCC GCTCCTGAGC ACCTTCCGCG ACATGGACGT GCTGGTCGCC
AACTCCTACT CCAAGGCCGT GCAACGCGCC GCGGTCGAAA CCTTCGTCGC CGAAACCCCC
GGCGTCGACT ACTTCCCGTC CTACGAATGC GTCACCCTGA GCGACCCGGC CGCGATCTGG
ACCGAGGGCG ACTTCCGCCA CGTCGCCCCC GATCTGGTCA CCCGCATCAT GTCCAGCGTC
CTGACCGCCT ATGTCCCCGG CTGGGGCGAT AAGGGCGCGC TTACCCGCGC GGCGACCCGC
GCCACCACGC GGCTTCTGCT CGGCGCCGGA CGCCATGACG AGCTTCTGGC GCTGCTCGAC
GCCCACGGCC CCACGGACGA TGCCGAGCTG ACCGCCGCCC ACGCCCTCGC CCTGCGCCGC
ACCGACCGCA CCGCGCAAGC CGTGGCCCTG ATGTGCGAGG TGGTCGAACG CACCCCCGAC
GACCCCCAGC CCCTCGAACG GGTGATCCGC TGGTGCGAAC AACTCGACCG CATGGCCGTG
GCCCGCGACT ACCTCGACCT GCACGCCCAA CGCTTCCCCA AGCGCCGCAA GTTCCGCCGG
GGCCGCAAGT GCCGCAAGGC CGCCAACCGG GGCCGCGGCT GA
 
Protein sequence
MTSNPIETGP APEVLRRASR NPLRRYPTPD AGGDRLYPLA MPAPTPSFEF SSKETVFALG 
SCFARNIEDA LAAEGFRVLS REFDLGEIGA SLDDAANFFN KYTIHSMLNE LRWAFDRDSF
PGADMLYPLR DDDTAYRDLQ LGSAKLAFPR DRILAFRHRF LDAVAQIAEA DVVVMTLGYV
EGWRDTRLDL ALNTAPPPAL CAREPDRFAF EVLSYEDVLG GLRAFHALLT AHRTKPLKML
LTVSPVPLLS TFRDMDVLVA NSYSKAVQRA AVETFVAETP GVDYFPSYEC VTLSDPAAIW
TEGDFRHVAP DLVTRIMSSV LTAYVPGWGD KGALTRAATR ATTRLLLGAG RHDELLALLD
AHGPTDDAEL TAAHALALRR TDRTAQAVAL MCEVVERTPD DPQPLERVIR WCEQLDRMAV
ARDYLDLHAQ RFPKRRKFRR GRKCRKAANR GRG