Gene Noca_0068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0068 
Symbol 
ID4600119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp76344 
End bp77522 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content59% 
IMG OID639774679 
Productinulin fructotransferase (DFA-I-forming) 
Protein accessionYP_921301 
Protein GI119714336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA CCGTTTACGA CGTCACCACC TGGACCGGCG CCACCGTCTC GCCTTACACC 
GATATCGGCC TGGTCATCAA CCAGATCATC GCTGACATCA AGAGCCAGCA GAACAGTCAG
ACGACCCGCC CTGGCGCCGT CATCTACATT CCTCCCGGTC ACTACACCCT GCAGACGACA
GCCAACATCG ACATTGGCTT CCTGACGATC AAGGGCTCCG GGCATGGCTT CATGTCCGAA
GCAATCCGCG ACGACGTGAA CCACTCGGCC TGGGTCGAGA CTCTCCCGGG CTCAAGCCAT
GTGCAGATCG CGAACAACAA CCAGGTCGGG TTCCTCGTCA ACCGGGCCAC CGACCCAGGG
ACGAACGGCC GACTCAACTC GATCGTCTTT CAGGACTTCT GCATCGACGG CGTCAGCAGT
ACCAAGCCCT ACATTCCTGG AAACGGAAAG ATCGGCATCA AGTCGCAGTA CGACACCGAC
TCCCTTCGAA TCGAGGGGAT GGGCTTCGTC TACCTCAACA CAGCACTGAC CATCCGCAAT
GCCGACGCCT TCAACATCAC CAACAACTTC ATCGCTGAGT GCGGCAGCTC CATCCAGCTG
ACAGACAGCT CCATCGTCGG AAAGATCACC AACAACTACC TGATTAGCGC GTGGGCTGGG
AACTCCATCT TCATCGAGAA CAACGAGGAC TGTGTCATCA GCGGGAACAG TCTCCTGTGG
GGCGCTCGGA TCCAGATGAA GAATGTCCAT CGCGCAGTGA TTACTGGCAA CAAGTTCGTC
AGCAACTTCT CCGGAATGAT CGTTCACGAA ACTCCGTGCC ACGAGCAGCT GATCTCGGGT
AACCACTTCC GTCGCAAGTA CGGCGACGGT GGCCCTGCCC GCAATGACGA TCTCTTCGGC
ATGGTCCATC TCAACGGCAA CGACAACTCC GTCACAGCCA ATCACTTCGC GTTCGAGGTG
CCGGCCGCCA ACATCGTCCC CTCCGGAGCC ACCCCCACGG TCGTCCTAGT CAAAGGTGGG
GCCCGCAACT TCCTCGCCAC AAACAAGTTG GCGTCAAACG TCGCGGTACG CCACGTGCTC
GACGCGAGTT CCACGGCCAC GAAGGTTCTC TGGTCGGGCA CGGCGGCCCA ACTCCAAGAT
CTGAGCGGGG GAAACATGTC CTTCGTGGCA ACGCCGTGA
 
Protein sequence
MATTVYDVTT WTGATVSPYT DIGLVINQII ADIKSQQNSQ TTRPGAVIYI PPGHYTLQTT 
ANIDIGFLTI KGSGHGFMSE AIRDDVNHSA WVETLPGSSH VQIANNNQVG FLVNRATDPG
TNGRLNSIVF QDFCIDGVSS TKPYIPGNGK IGIKSQYDTD SLRIEGMGFV YLNTALTIRN
ADAFNITNNF IAECGSSIQL TDSSIVGKIT NNYLISAWAG NSIFIENNED CVISGNSLLW
GARIQMKNVH RAVITGNKFV SNFSGMIVHE TPCHEQLISG NHFRRKYGDG GPARNDDLFG
MVHLNGNDNS VTANHFAFEV PAANIVPSGA TPTVVLVKGG ARNFLATNKL ASNVAVRHVL
DASSTATKVL WSGTAAQLQD LSGGNMSFVA TP