Gene Franean1_5320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5320 
Symbol 
ID5673654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6408825 
End bp6411098 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content74% 
IMG OID641244177 
Producthypothetical protein 
Protein accessionYP_001509584 
Protein GI158317076 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.16375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCCAC CAGCCGAAGG GCGGTCCAGG CCGCTCGGTA CCGTGCCGAG GGTGAGCAGG 
TCGAGTGATC CCGGCGCCGG CGCGGTCTCC GCGCCGGACG AGCCGTCCGA CCGGATCGGC
AAGGCCGCCG CGGACCAGAT CGGCACGGTC GTCCCGGACC GCCTGGTCGT GACGCCTGAG
CCGCTGCCCC GGCGGCTGTG GCGTCGCTGC CAGGAGGTCA GCACGGCCGG CCGGTACGTG
GCGGTCCTGG CCTTGACCAG TGCGACGGTG GCCGTCCTGC TGCGGCACAA CCTGTTCCCG
TACCTGTCGG TCAACAACGA CGAGGTCATC TACCTCCTGC ACGCCCGGAC GCTGGCGGAC
GGCCACCTCT TTCCGTCTGC GCCTGATCCA GCCGCGTCGT ACGCACCCTG GCTCGCCGCC
ATCTCGGGTG ATCACTTCGT TCTGAAGTAC ACGCCGTTCG TCCCGGGGTT GTTCGCGTTG
GGCCTCATGC TGACGGGCAG CGTCTCGCCG GTGCTGGCGG TTATCGCGGC GGCGGCGGTG
ATCGTTACCT ACCTGCTCGG TGTCGAGCTG GCGGGGGAGC GGAGGGTCGC GGCGCTGGCC
GCGACGCTGC TGGCCCTCTC CCCGTTGGTG ATCGTGCAGA GCGCGCTGGT GCTGAGCTAC
CTGCCGGTGC TCGTGCTGAT GGAGCTGACG TTGCTCGGCC TGCTCCGAGG GCTGAGAGCC
GGCGGGCTGA GAGCCGGCGG GCTGAGCGAT GGCGGTCGGC GCTCCGCGCG GCACGGCGGG
CGGGCGCTTG CCGGAGCGGG GCTCGCGGTC GGCGTGGCCG TGGCGGTGCG GCCGTATGAC
GTGGTCCTCC TGCTCGCCCC GGTGGCGGTC TGGGGCGTCG TGACGGCGCG CAGGTCCGGG
CGGCTCGGGT GGGCCCTGCG CTGGACGGCT GCCGGGCTCG TGCTCCCGGC CGCGATCCTG
CTGGCGTCCA ATGCCGCGGC GACCGGCAGC CCGTTCAGGT TGCCGTTCGC GCTGCTCGAA
TCGGACGACA AGCTCGGGTT CGGCGTGCGC AGGCTGTACC CGTCGGACGG CGGGCACGAC
TTCGGGCTCG GTGACGGACT GGCGTCGGTC GGGGATCATC TCTGGCTGCT CGGTGGCTGG
GCCTGCGGCG GGGTGGTTCT CGCCGTCGCC GCGATCGTGG CGGCCGCGCG GCGCCGGCTG
AATGGCCCGG GGTACGCCCT CGGCGTCGGC ATGGTGCTGT TCGTGGTCGG TTACATCGGC
TTCTGGGGGG CGTGGAACGC GGCGGAGCTG TGGGGCGGCA TCCGATACGT CGGGCCGTTC
TACCTGATGC CTGTGCTGAT CGGGCTCGTG CACCTCGGTG CGCGCGGGCT GGTCGACCTC
GCCGGCTGGT CACGGCGGCG GGCGGCACGG ACGGTGACCG GGGTGTGCGC CGCGGGTGTC
GTCGGGCTGA CGACGTTCGT CCTCGTCGGC GCGATCGACG CGAACGCGAC GATGACGGAC
CACGACCGTG ACCTGGCGGC GATGCTGCGG GCCCTGCCCG GGCGGTCCCT GGTACTGGTG
GCCGCCAGCC CGCCCTACCT GGGCCATCCG AGCGGTGTCA CCACCAACGG GGCCGATCTC
GACGGTGCCG ACGGTGACGG GCCGTTGCTG TTCGCGGTCT CGCGGGGGGT GGCCGACCTG
GAGGTCGTCG CCGACCACCC CGACCGCACG CCGTACCTGC TCCGCATGCC GCCGGCCTAC
AACCGGTCCC CGGGCTCGGT GACGCGCTCG CGGGTGGACG CGCTCACGGT GGCCACCGGT
CGTACGGTCG GCGTCGAGGT CAGCGTGGAC GCGCCGCCGC GAGGGACGCG CGCGGCCCAG
CTGGTGTTCG AAGCCGGTGG GGTCCGGCTG ACCTACCCGG TCTCAGCGAA CGGGCCCGTG
ACCGCGCGGC TCACCCTTGA CGCGGACGGC CTCGACACCG ACGACGTCAC GGAAGTCGTG
ATCTACGGTG GCGGCGAGAC CAGGGGATCT CCGGCGGGCG CGGCCACCGG GCGCCCGGCC
GGCCGGGCGA AGATCACGAA GGTGCCCGGG GTGGGGACGT CCGTGACGGT GTCGCTGCTG
GCCATCCCGG CGTCCGGTGG CCGTGCGCGG ACCGTCGACC GGCAGGTGAT CCCCGTCCTC
GTCGAGGATC CCAGCGGCGA GACGCCGGGC GATGTGGCAG TCCTCGCACC GAGCGCCCAC
GTGGACGAGA CCGGCCAGGG CCCGCGGCCC GCCGTCCGCA TCGCGCTGTC CTGA
 
Protein sequence
MGPPAEGRSR PLGTVPRVSR SSDPGAGAVS APDEPSDRIG KAAADQIGTV VPDRLVVTPE 
PLPRRLWRRC QEVSTAGRYV AVLALTSATV AVLLRHNLFP YLSVNNDEVI YLLHARTLAD
GHLFPSAPDP AASYAPWLAA ISGDHFVLKY TPFVPGLFAL GLMLTGSVSP VLAVIAAAAV
IVTYLLGVEL AGERRVAALA ATLLALSPLV IVQSALVLSY LPVLVLMELT LLGLLRGLRA
GGLRAGGLSD GGRRSARHGG RALAGAGLAV GVAVAVRPYD VVLLLAPVAV WGVVTARRSG
RLGWALRWTA AGLVLPAAIL LASNAAATGS PFRLPFALLE SDDKLGFGVR RLYPSDGGHD
FGLGDGLASV GDHLWLLGGW ACGGVVLAVA AIVAAARRRL NGPGYALGVG MVLFVVGYIG
FWGAWNAAEL WGGIRYVGPF YLMPVLIGLV HLGARGLVDL AGWSRRRAAR TVTGVCAAGV
VGLTTFVLVG AIDANATMTD HDRDLAAMLR ALPGRSLVLV AASPPYLGHP SGVTTNGADL
DGADGDGPLL FAVSRGVADL EVVADHPDRT PYLLRMPPAY NRSPGSVTRS RVDALTVATG
RTVGVEVSVD APPRGTRAAQ LVFEAGGVRL TYPVSANGPV TARLTLDADG LDTDDVTEVV
IYGGGETRGS PAGAATGRPA GRAKITKVPG VGTSVTVSLL AIPASGGRAR TVDRQVIPVL
VEDPSGETPG DVAVLAPSAH VDETGQGPRP AVRIALS