Gene Franean1_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1856 
Symbol 
ID5670258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2228430 
End bp2230109 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content74% 
IMG OID641240777 
Productglycosyl transferase family protein 
Protein accessionYP_001506200 
Protein GI158313692 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0355886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0028209 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAACGACT CGCGCACCAC CGGCGGTCCG CTGCCGGAGA CCGCCGGCCC GCTCGCCGCC 
GGCGGCCTCC CCGGCGCCCG CGCCGATGCC ACCGTCGCCG ATGTAGCCGT CGCCGGCACC
GCTGTCACCG GCACTACCGT CGCCGGCACT GCCGTCGCCG ACTCCGCTGA TGACGGTGAT
CCCGCCGACC GCCTCGACGT CTCCGTCGTC ATGCCGTGCC TGAACGAGGC CGAGTCGGTC
GGCGTCTGCG TCCGCAAGGC GCTGGCCGGG CTGGCCGCCG CCGGAGTCGC GGGCGAGGTC
GTCGTCGTCG ACAACGGCTC GACCGACGGC TCCGCGGCGG TGGCGACCGC GGCCGGCGCG
CGCGTCGTCG CCGAGTCACG GCGTGGCTAC GGCAACGCCT ATCTCGCCGG CTTCGCCGCC
GCGCACGGCC GGTTCCTGGT CATGGGCGAC TCCGACGACA CCTACGACTT CGCCGACCTC
GGCGCGCTGC TCGCCCCGCT GCGCGCCGGG CGCGCCGACT ACGTGCTGGG TTCCCGGTTC
GCCGGTGAGA TCCTGCCCGG CGCCATGCCC TGGCTGCACC GATACGTCGG CAACCCGCTC
CTCACCGGCA TCCTCAACCG CCTGTTCGAC GTCCGCTCGT CCGACGCCCA CTCCGGGATG
CGGGCCTTCA CCAGGGACGC CTACCGGCGG ATGCGGCTGC GCTGCGAGGG CATGGAGCTC
GCCTCCGAGC TCGTCATCGC CGCCCGTCGA GCCGAGCTGC GGATCGAGGA GGTGCCGATC
ACCTACCACC CGCGGGTCGG GGCGTCGAAG CTCCACTCAC TGCGGGACGG CTGGCGCCAC
CTGCGGTTCA TGCTGCTGCT GGCGCCCAGG CACCTGTTCG TCCTGCCGGG TCTGGTCCTG
TTCGGGCTGG GCACGGCCGG CCAGCTGGCG CTCCTGCCCG GTTCGCTCGA TGTCGGGTTC
CACCGGCTCG ACCTGCACTT CTCCGTGCTG TTCGCACTGA TCGCGATCCT CGGTTGGCAG
TTGGTGCTTC TCGGTGTCTT CGCCGACGTC CACAACCATG CCGCGGGGTG GCAGGAGCGC
CGCCGCTGGC CGCTGACGTC GATCCACCGG CGTTTCACGC TCGAGCGGGG CCTGGCGGCC
GGCGGGATCC TGTTCACCGT CGGCTTCGCG ATCGACTGCG TCATACTCGC CCGATGGCTG
GCGAACTCGA TGGGGCCGCT CAACGAGCTG CGCCCCGCCC TGCTCGCCAT GTCGCTGATG
GTGCTCGGCG CGCAGACCGC CTTCGGGTCG TTCTTCCTGC GGCTGGTGAC GGCCGGGCCG
AGCGGCGGCC ACCGCCGGGC CGGCTGGGCG CCCGCGACCG GACTCGCCGT CTCAGCCGCG
GCCACCGGCG CGTCGCCATC CGCAGTCTCG CCGTCCTCAG TGTCGCCGGC CGATCCCCCG
CCGGCCGCGG CCCCGCCGGC CGCGAGGCCG GTGGCGGCCG AGCCGGGCGA CGACGCGGCG
GCTGGTGACC GCGCCCCCGG GCCGGTCGGG GCGCCGGGCC CGGTCGGGAC GCCGACGCCA
GACGGCGGAG CCGGTCCGGC TACCCGCACG GGCCCGGAAA GTCATGAGAA CGACGAGGAT
GCGGCGTTCC TCGCCGCGCC CGCACCCGTC CTCGGTGGAA TTCCCGCCCA CCAGCGCTGA
 
Protein sequence
MNDSRTTGGP LPETAGPLAA GGLPGARADA TVADVAVAGT AVTGTTVAGT AVADSADDGD 
PADRLDVSVV MPCLNEAESV GVCVRKALAG LAAAGVAGEV VVVDNGSTDG SAAVATAAGA
RVVAESRRGY GNAYLAGFAA AHGRFLVMGD SDDTYDFADL GALLAPLRAG RADYVLGSRF
AGEILPGAMP WLHRYVGNPL LTGILNRLFD VRSSDAHSGM RAFTRDAYRR MRLRCEGMEL
ASELVIAARR AELRIEEVPI TYHPRVGASK LHSLRDGWRH LRFMLLLAPR HLFVLPGLVL
FGLGTAGQLA LLPGSLDVGF HRLDLHFSVL FALIAILGWQ LVLLGVFADV HNHAAGWQER
RRWPLTSIHR RFTLERGLAA GGILFTVGFA IDCVILARWL ANSMGPLNEL RPALLAMSLM
VLGAQTAFGS FFLRLVTAGP SGGHRRAGWA PATGLAVSAA ATGASPSAVS PSSVSPADPP
PAAAPPAARP VAAEPGDDAA AGDRAPGPVG APGPVGTPTP DGGAGPATRT GPESHENDED
AAFLAAPAPV LGGIPAHQR