Gene Francci3_3613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3613 
Symbol 
ID3904167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4313582 
End bp4315003 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID637880934 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_482694 
Protein GI86742294 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGCGC ACAGCTCGGG AGTGGTACCA TCCACCAGCA CGGACCGACG GTGGTATCGC 
ATCGTCGGCG GCGCTCCGCT CACCGGCTCG GTCGAAGCGT CCGGGGCCAA GAACTCGGTC
ACCAAGCTGC TCGTCGCCAC CCTCCTCACG CCAGAGCCGT GCACCCTGAC CCGGGTGCCT
CGGATCGCCG AGGTCGACGT GGTCCTGGGC ATGCTCGCCG AACTCGGCAC CCAGGTGGAG
TGGCTGGACG AGCACACTGT GCGACTCGCC ACCGAAAAGA TCATCGACGC GTCGCTGTCC
GAGGCCTACT CCGGGGTCAA CCGGATACCG ATCCTGATGA TGGGCCCGCT CCTGCACCGC
GTCGGCGAAG CCGTGATCCC GCTGCCCGGC GGTTGCCGGA TCGGCAAGCG TCCCGTCGAC
TTCCACCTCG CCGGCCTGCG TGCCATGGGT GCGGAGATCG TCGAGCAGCT GCGTTCCGTC
CAGGTCAAGA CGCTGGGTCT GCACGGCACC CATGTCGCGC TGCCGTTCCC GAGTGTCGGC
GCGACCGAGA ACCTGCTCCT GGCCGCGGTC CGGGCGCAGG GTACGACCGT CATCTCGAAT
GCGGCGGTGG AACCGGAGGT CATCGACCTC ATCATGTTCC TGCAGCAGAT GGGCGCCCTC
GTCGACGTGG AAGTCGACCG CACGATCGTG GTGCAGGGGG TCGACGCACT GCGGGGAGCC
ACCCATGCCC CGATCCACGA CCGCATCGAA GCCGCGTCCT TCGCCTCGGC CGCGGTGGCC
ACCAACGGCA GGGTGGAGGT GGTCGGTGCC CGCCAGGAAC ATCTGGCGAC CTTCCTCAAC
CACCTCCGCC GGCTCGGTGG CGAGTTCGAG GTCACCCCGC GGGGCATGAC CTTCTTCCGG
GCCCAGCCGT TGACCGCGTC ACATGTGCAG ACCGACGTAC ACCCCGGGTT CATGACCGAC
TGGCAGCAGC CCCTGGTCGT CCTGCTCACC CAGGCCAAGG GTGCCTCCGT GATCCACGAG
ACGATCTACG AGGACCGGTT CGGATACACC AGACAACTGG CCGAGATGGG CGCCGACATC
GCCCTGTCGA CCCTGTGCCT CGGTGGGAAG GCCTGCCGGT TCGCGTCCCG GGACTTCGAG
CATTCGGCGG TCGTGAGCGG GCCGACCTCC CTCACCGGCG GTGATCTGGC GATCCCCGAC
CTGCGCGCCG GCTTTGCCTA CGTGCTGGCC GCGCTGGTGG CCGACGGCAC CAGCATCATC
CGCGGGACCC GTTTCCTCGA ACGCGGTTAC GAGGACCCGG TGGGTAAGCT GCGCTCGATC
GGAGCGTTCA TTGACACGCA GGCCGTGGGT TCGCCCCCGG CACCGACCAC GCCCCGACCT
GACCGGAACG ACGGTGCTGC CGGGCCAGGC GCATGCAGGT GA
 
Protein sequence
MGAHSSGVVP STSTDRRWYR IVGGAPLTGS VEASGAKNSV TKLLVATLLT PEPCTLTRVP 
RIAEVDVVLG MLAELGTQVE WLDEHTVRLA TEKIIDASLS EAYSGVNRIP ILMMGPLLHR
VGEAVIPLPG GCRIGKRPVD FHLAGLRAMG AEIVEQLRSV QVKTLGLHGT HVALPFPSVG
ATENLLLAAV RAQGTTVISN AAVEPEVIDL IMFLQQMGAL VDVEVDRTIV VQGVDALRGA
THAPIHDRIE AASFASAAVA TNGRVEVVGA RQEHLATFLN HLRRLGGEFE VTPRGMTFFR
AQPLTASHVQ TDVHPGFMTD WQQPLVVLLT QAKGASVIHE TIYEDRFGYT RQLAEMGADI
ALSTLCLGGK ACRFASRDFE HSAVVSGPTS LTGGDLAIPD LRAGFAYVLA ALVADGTSII
RGTRFLERGY EDPVGKLRSI GAFIDTQAVG SPPAPTTPRP DRNDGAAGPG ACR