Gene Franean1_6403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6403 
Symbol 
ID5674718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7772815 
End bp7775067 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content75% 
IMG OID641245251 
Producthypothetical protein 
Protein accessionYP_001510646 
Protein GI158318138 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.187686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCG ACTCCCCGGT CGGCGCGTCG ACCGGAGGTG GTCCGACCAC GGCGGACAGC 
ACCGCCGGGA CGCTCCCGGG TGCACCCGTC GACCTGGCCG ACGGCACCGG TCACGCGGAG
CCGTCCGGAG CGGGCACGCA CGCAGGCCCA CGAGCCCGGC GGTTCCGGTG GTTCCGGCAA
AGCCGGCACT TCCTGCGGGT CCAGTCGTTC CGCTTCCGGT CGCCGGCCGG CGTCGTCGCT
GTGGTCGCGC TCCTGTCCTT CGTCGCGAGC CTGGTCCTGC GGGCCACGCT GTACCCGGAC
GGCTCCGGCG ACGCGGACGA GGCCGCCTAC ATCCTCCAGG CGCGGATGCT GCTCGAGGGG
CGGCTGACCC TGGACGCGGG CGCCGTCGAA CCGTTCTTCC GGCCGTGGCT GACGGGTGTG
CACGACGGTC ACGTGTTCAC CAAGTACCTG CCCGGCTGGC CGGCGTTGCT CGCGCTGTCC
CAGGTGCTGT TCGACACGAT GGCCGTGGCG CCCGCCGCCG TCGCCGGGGT CTGGGTCGTC
GGGACCTACC GGCTTTCCCG CGAGCTCTTC GACCATGCCT GGAGCGCGGT GGCCGCCGCT
GTGGGCGTCG CGCTCTCACC GCTGGTCCTC CTGCATACGG CACTGCCACT CGCCTACGCC
CCGGGGGCGG CCGTGCTCGT CCTGGCCAGC GCGGAGCTGT TGCGCGGTGC ACGGACGGGG
GCGCGGGCCG CCCTGGCCGG CGGCGGCGCG GGCCTGGGGC TGGTGCTGCT GATCCGGCCG
TTCGACGTCG TGCTCGTGCT CCTCCCGCTG GCCGCGCTCG CGGCCGCGCG GCGCCGGCGG
GAGCTCGGGA CGCTGCTGCG GCGCTCGGGC TGGGCGGTGC TCGGCGCACT GCCCCTGGTC
ACCGCCCTGC TCGCCTACTG CTGGCGGGTC ACCGGATCGC CCCTGCGTAT GCCGCTGTCG
GCCTCGGACC CACTGGACCG GTTCGGTTTC GGCCCGCGCC GGATCCTGCC GTCCGAGTCG
AGTTTCCTGT TCACCCGCCG GCTGGCGCTC GACGCCCTCC AGGAGACGCT CGAGGTGGCG
CCGAGCTGGT TCTTCGGCGG TGCCGCGCTG ATCGCGCTGG CCGCCGTCGG CCTGGTGGCG
CCGCGGCGGC GACTCGAGCG CCTGTTCCTG CTCGCGACGA CGGGCGTGGT GCTGGCCGGC
TACACGTTCT GGTGGGGGTC GGCGTTCGCC ATGCCGGGCC TGCGCAACGG CCTCGGCCCG
CACTACCATC TGGCCGCGTT CACCCCGGTC GTCATCCTCG CGGCCGACGG CGCCCGATGG
CTGTGGACGT TCCTCCCGGC ACAGTTGCCG CTGTTCCGCC GCCCGGGGGC ATCCGTGCCG
GGTACCGCGC GCGGCCTGGC CTGGCGCATG GTCCGTCCGG GGGCTGTAGC GCTCGCCGTC
GCCGGGCTCG TCGCGATCAC GGTGCCGACC CTGCAGCCCC GGATCGACGT GCAGCGCGGG
GTCAACGAGG GCAACGACTT CCTGGCCGCG CTGCTCCCGG ACAACCTGGG CGGGCCGGCC
GTGGTGCTGG TGACGCCGAC AGTCCCGAGC CGCTACACGC AGGTTCCCTA CCATTCGCTG
CGGAACTCCC CCGACCTCGA CGGACCTGTC GTCTTCGCGG CGGACATCGG GCCGGGCTCG
GCCGCGCTGC CTGACCGGAT GCCGGACCGG GCGATGTTCC GCCTGCGGCC GGACGAGATC
GCAGACCCGG CGGTCCCGGG CAGCTTCCGG GGGTCCTTCG TGCCGCTGAG GCAGGTCACC
GGGAGCCGCG TCGAGATCCG TGTACAGGTA CGGATTCCCG GTGACGCTGG GACGGCACCC
GTGCGGTCGG GGGATGCGCG GCTGTACGTC CGCCTCGGCG GGGAGGTCCG CACCCTGCGA
ACAGCCGTCC CGGTGACGAT CACGCACACG TTCGTGCTCA CCACCGGTCC CGGCACCGGC
CCGGACGAGA TCGGGACCGC CGGCGCGTCG CTGCCCGCCG AGCTCGTCGT CGGGTTCACC
GACGGCACCG GCCCGGCGAG CGGGGCCTGG GAGGAGCGGT TTCCCCTGGT ACGCCGCCCC
GGTGGCGACC TCTCCCTGCT CGCTCCCGGG CTGGGCTGGC GCCGACTGTC CAAGGTGGCT
GGCAGCGGCG CGGCCGGCGG CGACGGCCAG TGGCTCCCGG CCACCGCGAA ACCCACCTTG
GACGTCTCGC TGACCGGCGC CGCTGCCCGC TGA
 
Protein sequence
MTVDSPVGAS TGGGPTTADS TAGTLPGAPV DLADGTGHAE PSGAGTHAGP RARRFRWFRQ 
SRHFLRVQSF RFRSPAGVVA VVALLSFVAS LVLRATLYPD GSGDADEAAY ILQARMLLEG
RLTLDAGAVE PFFRPWLTGV HDGHVFTKYL PGWPALLALS QVLFDTMAVA PAAVAGVWVV
GTYRLSRELF DHAWSAVAAA VGVALSPLVL LHTALPLAYA PGAAVLVLAS AELLRGARTG
ARAALAGGGA GLGLVLLIRP FDVVLVLLPL AALAAARRRR ELGTLLRRSG WAVLGALPLV
TALLAYCWRV TGSPLRMPLS ASDPLDRFGF GPRRILPSES SFLFTRRLAL DALQETLEVA
PSWFFGGAAL IALAAVGLVA PRRRLERLFL LATTGVVLAG YTFWWGSAFA MPGLRNGLGP
HYHLAAFTPV VILAADGARW LWTFLPAQLP LFRRPGASVP GTARGLAWRM VRPGAVALAV
AGLVAITVPT LQPRIDVQRG VNEGNDFLAA LLPDNLGGPA VVLVTPTVPS RYTQVPYHSL
RNSPDLDGPV VFAADIGPGS AALPDRMPDR AMFRLRPDEI ADPAVPGSFR GSFVPLRQVT
GSRVEIRVQV RIPGDAGTAP VRSGDARLYV RLGGEVRTLR TAVPVTITHT FVLTTGPGTG
PDEIGTAGAS LPAELVVGFT DGTGPASGAW EERFPLVRRP GGDLSLLAPG LGWRRLSKVA
GSGAAGGDGQ WLPATAKPTL DVSLTGAAAR