Gene Franean1_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2161 
Symbol 
ID5670561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2591904 
End bp2593292 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content72% 
IMG OID641241082 
Productglycosyl transferase family protein 
Protein accessionYP_001506503 
Protein GI158313995 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGAG CGGGAGCCGC GGCGTTGCTG GTAACCCTCG CGGCCACACC GGTCGTGCTC 
GGCGCCATGC GACGCCTGGC CGCCATCGAC GATGTGAACG AACGGTCCTC ACACCAGGTA
CCGACCCCGC GGGGCGGCGG CATCGCGGTA GCCCTCGGCC TCTTCGCCGG CGTAGTCGTC
CTCATGCTCG CCAGCGGTCA CGGCGACGCC CCCGACCTGC TCCCGATGAC CGTCGGGGTG
ACACTGTTCG GTCTCATCGG TCTCGCCGAG GACATCGGCG GCGACGTGGG CGGCATAGCC
CCACTGCGCC GACTGGCACT GCAGCTGCTC GCCGGCCTGG CGGTGTCGAC CCTGCTCCTC
ACCAGCGCGT CCCTCGACGC CGTGTCGATC CCACCCGTGG TGGTGCTGTC TGCCGCCGCA
CTGATCGGCC CGCTGTGGGT GACGGGGTTC GTCAACGCCT TCAACTTCAT GGACGGGATC
AACGGCATCT CGGCCGCGCA GGCCGCAGTC GCCGGCGGCG CCCTCGTCGT CGTGGGCCAT
CTCCACGACG CGCCGGCCCT GGCTGCGGGC GGCGCCGTTC TTGCTGGGGC GGCGATCGGC
TTCGCGCCGT TCAACTTCCC CCGCGCGCGG ATCTTCCTGG GCGACGTCGG CAGCTACACC
CTCGGCGCCA GCCTCGCCGT ACTCGCGTTG CAGGGCGTCA TGTCCGGCAT CCCGGCCGAG
GCGGTGCTCG CGCCGATACT GCTCTACCTC GCCGACACCG GCGCCACGCT GGTGCGTCGG
GTCCGGCGGG GCGAGCGTTG GTACCTGCCC CACCGCACTC ACACCTACCA GCGGCTCACC
GACGTGGGAT GGACGCACAC CCGGGTCACG TTGACGGTTG CCGGTCTCGT CGCGGCGATG
TCGGGTCTGG GGCTGCTTGG CACGCGGGGT GGCACGGGTA GCCGAGTTGT GGCCGACCTC
GGACTACTCG CCCTGGCCAC GGGCTACCTG AACGCTCCCC GGCTGATCGC ATCCGCCAGC
GCGCGGATAC ACCCAGGACA GCCCGCTGGC ACCCGTCCGG CGTCGCCTCC AGGCCCTGTC
GCAGAGCCCG ACCCAGCCGC CGCACCACCA CGTGCCGCAC CACCACTCGG CGCGTCACCA
CTCGGCGCGT CACCGCCCGG CAGTCCACCG GCCGCCATCG CGGTGATAAA TGCCATGGCA
GTAGGTCGGG CGCCGGCGGG GGACGGCGCC GGGAGGCTCG TTCTGCCACG CCAGCGACAG
GCGGGCGATC CAGGGCTCAC CACCGATCCG CCCGAACCCG CCCGGATGAC CCAGCTACCC
GACCAGGCCG GCGCCCGGCG ATACGGAGAT CAGGGAGATA CGGGAGATAC GGAGATCAAA
GACGCCTGA
 
Protein sequence
MLGAGAAALL VTLAATPVVL GAMRRLAAID DVNERSSHQV PTPRGGGIAV ALGLFAGVVV 
LMLASGHGDA PDLLPMTVGV TLFGLIGLAE DIGGDVGGIA PLRRLALQLL AGLAVSTLLL
TSASLDAVSI PPVVVLSAAA LIGPLWVTGF VNAFNFMDGI NGISAAQAAV AGGALVVVGH
LHDAPALAAG GAVLAGAAIG FAPFNFPRAR IFLGDVGSYT LGASLAVLAL QGVMSGIPAE
AVLAPILLYL ADTGATLVRR VRRGERWYLP HRTHTYQRLT DVGWTHTRVT LTVAGLVAAM
SGLGLLGTRG GTGSRVVADL GLLALATGYL NAPRLIASAS ARIHPGQPAG TRPASPPGPV
AEPDPAAAPP RAAPPLGASP LGASPPGSPP AAIAVINAMA VGRAPAGDGA GRLVLPRQRQ
AGDPGLTTDP PEPARMTQLP DQAGARRYGD QGDTGDTEIK DA