Gene Franean1_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0337 
Symbol 
ID5668761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp403117 
End bp405732 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content71% 
IMG OID641239268 
Productglycosyl transferase family protein 
Protein accessionYP_001504709 
Protein GI158312201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0744] Membrane carboxypeptidase (penicillin-binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0174987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCCAG CCGTTCCCCG TGTTCCCCGT GTACCCCGTG TACCCCGTGT TCGGAGCGTT 
CCCGGTGCCA CGGTGGGTCC AGGCGTGATC CGGTCGAGAC TCCGCGGCGC GGCCGACGCG
CGGTCCACGT GGCAATCAGG CCTGTCCGCG CCGGCCTCCG GCCTCGCCTC ACGGCATATC
GGGCCCGACT CGGCGCGCCG CCGCCGAGTC ACACCGAAAC GGCTGCTCGG CATCCTCACC
ACCAGCGTGG CGGCGGGCGT TCTGGTCGGC ATGCTCGCGC TGCCCGTCGT CGGCCTGGCC
GGCGTGACCG CCAAGGGCGG CGCGGACCAT TTCCTTTCTC TCCCCGCGAA TCTGACCGTC
CCCCCGCTGG CCCAGCCCTC ACGCATCCTG GACGCCGCCG GAAACCAGAT CGCGGTGCTG
CGCGGCGAGC AGGACCGCGA GATCGTCGCC CTGGACAAGG TCCCGCCGCA GATGCGCCAG
GCGATGATCG ACATCGAGGA CGCGCGCTTC TACGAGCACA CCGGTGTCGA CTACCGCGGA
ATGATCCGCG CGTACCTCGC CAACCAGGAG TCCGGCGGGG TCACCCAGGG CGGATCGACC
CTCACCCAGC AGTACGTGAA GAACGTCCTG CTGGCCTCCG CCCGGACACC GGAGGAAAAG
GCGGCCGCGA CCGAGCAGAC GGTCGACCGC AAGCTCCGCG AAGCCCGTTA CGCGCTGTAT
CTGGAAGAAC ACCTCACCAA GGACGAGATC CTCGAGCGGT ACCTCAACAT CGCCTATTTC
GGCGACGGCG CGTACGGCGT CCAGGCGGCC GCCCGGCACT ACTTCAACAT CGACGTCAGC
CAGCTCGGCG TCACCCAGTC GGCGATGCTG GCCGGCCTGG TGAAGAACCC CACCGCGTAC
AACCCGGCGC TTCACCCGCA GGCCGCCAGG GAACGCCGCA ACATCGTCCT CGACCGGATG
CACGAGCTCG GCCACCTCGA CGCCGCCGCG TGGAAGGCGG GCCGGGCCGA GGAACTCGTC
CTCGACCGCC CGGCGCGGTC ACCGGACGCC TGCCAGGACT CTTCCGCCCC CTTCTTCTGC
TCCTACGTCC GCCAGGTGCT GCTGGCCGAC CGGACGTTCG CCGCCACCCC CGAGGACGCC
CGCCGGCTGC TGTTCGAGGG CGGGCTCACC ATCCGGACCA CGCTCGACCC GGTCGCGCAG
GGCGCGGCGC AGACATCCGC ACGCGAGGTG ATCCCCACCG GGAACCGGGT CGCCGCCGGC
GTCGCGATGG TGCAGCCGGG CACCGGCAAC GTGCTGGCGC TCGCCGTGAA CCGGGATTAC
GGCACGCCGG ACGACAACCA GCCGCCGGCG CTGACCGCCG ACTTCGTCCA CACCAAGGAG
ATCTACCCGG TCGACCCCGA CTCGTTCTCG CCCGGCTCCA CGTTCAAGGT CTTCACCCTG
GCCGCAGCGC TGGAGAACAA CATCCCGCTG TCGACCACCT TCCACTCGCC GCTGTGCTAT
CACTCGGACC GCTTCCCCAA CCCGGATCCG GGCGGGAAGA ACTGCTACAG CAACGCCGAC
CCGAGCGAGG ACGGCTTCTA CTCCCTCACC ACGGCGACCT GGAACTCGGT CAACACCTAC
TACATCCAGC TCGCCGAGCG CCTCGGCGTC ATGAAGACCG CCGAGATGGC CCGCCGGCTA
GGCGTCTCCT CCTGCCGGAT CCGGCCGAAT GAAGAGAACG ACCCCGAGTG CAAGGGTGTG
GAGGGGATCG GTCCGGTCGA CGGCTCGGCG ATCCTCGGCT CGAACGAGAT CAGCACGCTG
GACCTCGCCA CCGCCTACGC GACGCTCGCC GCCCGCGGAA ACCGCTGCTA TCCCCGCACA
GTGCTCTCGA TCACACAGCG CGTCGGGACG GCTGACCGTC CGCTGGCCTT CAACACCGGA
AAGCCGTGCG AACAGGTCCT CGCCCCCGGG ATCGCCGACA CCGTCGCCTC GGTTCTCCAG
GGCGTCATCA ACCACGGGAC CGCGTCCGGG AACGCGCAGA TCGGACGTCC GGCGGCGGGC
AAGACCGGAA CGGCCGAGGC ATTCAGCACG GCGTCCTTCG CCGGTTTCAT CCCGCAGCTG
GCCACCGCGG TCACCCTCGC CGACCCGCGC GGCCCGACGA CTCATCAGCT TCGCAACGTG
CTCACCAGCC GTGTGGTGTT CGGCGGCGGA TTCCCGGCCC AGATCTGGGC CCGGACGATG
ACGCGCACGA TCGACGGCTT GGCGCTGCCC GTCATGCCAC TGCCGGCGCC GGACAACACC
CAGCCGCAGG TGCCCAAGAA GGCCCTGCCG GACGTCCGCG GGCAGAGCCA GCAGGCGGCC
GAGTACGCTC TGCGCTCGCT GGGATTCCGG GTCCGCTCGG AGACCGTGCC GCACGTGGCC
CCGCCCGGGA TCGTCGTCGG CATGGCTCCC GGGCCCGGGG AGGAGATCTC CATGGATACC
GAGGTGGTGC TCCAGGTGTC CGGCGGCCTC ACCGGCGCGG TGCTGCTGCC GAACGGCGCC
GGCGCACGCC CCGGCCAGCC CGCCGCCCCA CCGGCGGGGG ACGGCCGCGC AGGCATCCCG
GGGCTGCCGG ACCTGCGCAA CCAGCTCCGG AACTGA
 
Protein sequence
MHPAVPRVPR VPRVPRVRSV PGATVGPGVI RSRLRGAADA RSTWQSGLSA PASGLASRHI 
GPDSARRRRV TPKRLLGILT TSVAAGVLVG MLALPVVGLA GVTAKGGADH FLSLPANLTV
PPLAQPSRIL DAAGNQIAVL RGEQDREIVA LDKVPPQMRQ AMIDIEDARF YEHTGVDYRG
MIRAYLANQE SGGVTQGGST LTQQYVKNVL LASARTPEEK AAATEQTVDR KLREARYALY
LEEHLTKDEI LERYLNIAYF GDGAYGVQAA ARHYFNIDVS QLGVTQSAML AGLVKNPTAY
NPALHPQAAR ERRNIVLDRM HELGHLDAAA WKAGRAEELV LDRPARSPDA CQDSSAPFFC
SYVRQVLLAD RTFAATPEDA RRLLFEGGLT IRTTLDPVAQ GAAQTSAREV IPTGNRVAAG
VAMVQPGTGN VLALAVNRDY GTPDDNQPPA LTADFVHTKE IYPVDPDSFS PGSTFKVFTL
AAALENNIPL STTFHSPLCY HSDRFPNPDP GGKNCYSNAD PSEDGFYSLT TATWNSVNTY
YIQLAERLGV MKTAEMARRL GVSSCRIRPN EENDPECKGV EGIGPVDGSA ILGSNEISTL
DLATAYATLA ARGNRCYPRT VLSITQRVGT ADRPLAFNTG KPCEQVLAPG IADTVASVLQ
GVINHGTASG NAQIGRPAAG KTGTAEAFST ASFAGFIPQL ATAVTLADPR GPTTHQLRNV
LTSRVVFGGG FPAQIWARTM TRTIDGLALP VMPLPAPDNT QPQVPKKALP DVRGQSQQAA
EYALRSLGFR VRSETVPHVA PPGIVVGMAP GPGEEISMDT EVVLQVSGGL TGAVLLPNGA
GARPGQPAAP PAGDGRAGIP GLPDLRNQLR N