Gene Noca_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2022 
Symbol 
ID4598644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2167558 
End bp2168559 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID639776626 
Productextracellular solute-binding protein 
Protein accessionYP_923219 
Protein GI119716254 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0434097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCG TACCGGCCCT CCTCGCCCTG TCCCTCGCAG CCACGCTGCT CGCCACCGGC 
TGCGGCAGCG ACAGCGACGC CCTGGTCATC TACAACGCCC AGCACGAGGA GCTGATGACC
GACGTCGCCA AGGCCTTCAC CGACGAGACC GGCATCGACG TCGAGCTGCG CAACGGCAAG
GACCTGGAGA TGTCCGCCCA GCTGGTCGCC GAGGGCAAGG CCTCGCCCGC CGACGTGTTC
CTGACCGAGA ACTCCCCCGC CATGGCGCAG GTCGAGAACG CCGGCCTGTT CACCGAGCTC
CCGCAGGACG CGGTCGCCCC GATCCCGGCG ATGTACCGGC CACGCAGCGG GCTGTGGACC
GGCTTCGTGG CCCGCTCGAC CGTGCTCGTC TACAACACCG ACCAGGTCTC CGCGGACGAG
CTGCCCGACT CGATCCTCGA CCTCGCCGAC CCCGAGTGGA AGGGCCGGAT CTCCTTCTCC
CCCACCGGCG CGGACTTCCA GGCGATCGTC GCCGCGGTCC TCGACCTCGA GGGCGAGCAG
AAGACCCGCG CCTGGCTGGA GGGCATCAAG GCCAACGGCA CCGTGTACGA CGGCAACAAC
GTCGTCCTCG AGTCGGTCAA CTCCGGCGAG TCCGAGGTCG GGATCATCTA CCACTACTAC
TGGTACCGCG ACCAGGCCGA GTCGGGCGAC GTCTCCGACC ACAGCGCCCT GTACTTCTTC
GGCCACCAGG ACCCCGGCGC GTTCGTGAGC GTCTCCGGCG CCGGCATCCT CGCCTCCAGC
GACCACCAGG CGGACGCGGA GAAGTTCGTG TCCTACCTGA CCAGCACCGC CGGCCAGCAG
GTGCTCGCCG ACAGCTACGC GCTGGAGTAC CCGCTCAACC CCGACGTCCA GCTCGACCCA
CCGGTCAAGC CGTTCGCCGA GCTCGATCCG CCCCAGGTCA ACGTCTCGGA CCTCGACGGC
AAGGCCGTGG TGGACCTGAT GACCGAGGTC GGGTTCCTCT GA
 
Protein sequence
MKRVPALLAL SLAATLLATG CGSDSDALVI YNAQHEELMT DVAKAFTDET GIDVELRNGK 
DLEMSAQLVA EGKASPADVF LTENSPAMAQ VENAGLFTEL PQDAVAPIPA MYRPRSGLWT
GFVARSTVLV YNTDQVSADE LPDSILDLAD PEWKGRISFS PTGADFQAIV AAVLDLEGEQ
KTRAWLEGIK ANGTVYDGNN VVLESVNSGE SEVGIIYHYY WYRDQAESGD VSDHSALYFF
GHQDPGAFVS VSGAGILASS DHQADAEKFV SYLTSTAGQQ VLADSYALEY PLNPDVQLDP
PVKPFAELDP PQVNVSDLDG KAVVDLMTEV GFL