Gene Franean1_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1547 
Symbol 
ID5669950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1848044 
End bp1850014 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content73% 
IMG OID641240466 
Producthypothetical protein 
Protein accessionYP_001505892 
Protein GI158313384 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.736781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.90669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCATCG CGAGCAGATC GGGCCCACTG CCGGGCCCGT TGACGACGTC GGACCGCCGG 
GAGCCAGACG GCCCGGGCGC CCGCGGTAGT GGTCGGCGGG ATGACCGGCG CGCCGACAGA
CCGCGGGCCG CCGACAGCAC TCGGCCGGCC GACGGCCCAC GGATCACCCG CGGGCCGCGG
CGTGCCGACA GCCCGCGGCG TACCGAAGGC CCATGGCGTG CCAGCGGGCC ACGGGTCGCC
GCGCTGGTGA CACTGCTGCT CGGCCTCGTG GACATCGCGG CCGTGTTGAC GCCGGGGTGG
CACTCGCGGC TGGAGGCGCT GCGGGAGCTG CTGCCCGCCG CCGCCTCCCG GCAGGCCGCC
GCGTTCACCG TCGTGGTCGG AATGCTGCTG GTGCTGCTCT CGGCCGGGCT CCGGCGGCGC
AAACGGCGAG CGTGGCGAGC CGTCGTGATA CTCCTCGGCT CGAGCGTGAT CCTGCACCTC
GCCCGGGGGC TGGACTACGA GGAGGCTGCC GGGTCGGCCG CGCTGTGCGT CGCCCTGCTG
CTGGCCCACA GGCAGTTCCA GGCCAAGGGC GATCCGACGA CGCGCTGGCG GGCGGCCGGC
GTGGGGCTCC TGCTCACGGT CGTCTCGATC GGGGTCGGCC TGCTGCTGCT GAACCTGCGC
GGCAGTCGGA TCGCCGGCCC CCATCCACTG TCCGCGGAGC TGGAGCAGAT TGTCCTGGGG
CTGGTCGGCA TTCCGGGCCC GCTGGGGTTC AGCTCCGCCC GCTTCGCCGA CCTGGCCAAC
CGGATGCTGC TGACGATGGG TGTACTGACC ATCGGGTCCA CCGCCTATCT GGCGCTTCGC
CCGCCCGAGC CACGGCCGAG ACTGACCGAC GCGGACGAGG CCCGGGTGCG GGACCTGCTG
GCCGGTCACG GCTGCGCCGA TTCGCTCGGG TATTTCGCGC TGAGATCCGA CAAGTCGGTG
ATCTGGTCGC CGACCGGGAA GTCCTGTGTG GCCTACCGCG TGGTCTCCGG GGTGATGCTC
GCCAGCGGCG ATCCGCTGGG TGACCGGGAG GCGTGGCCAG GCGCCATCAG GGAGTTCCTG
CGCGAGGCGG CCGACCACGC CTGGACGCCC GCGGTGATCG GCTGCTCGGA GGCGGGCGGA
ACCGCCTGGA CCAGGGCCGG CCTGTCCGTT CTCGAGTTCG GCGACGAGGC CGTCGTCGAG
ACGGCCGGTT TCACCCTGGA GGGCCGGACG ATGCGTAACG TCCGGCAGGC CGTCGCCCGA
GTCGAACGCG CCGGCTACAC GGTGGACATC CGACGAGTTC GCGACCTGAC GCCGCAGGAT
GTCGACCGTC TCAAGGCACA GGCCGCGGCC TGGCGGGGCA CCGAGACCGA GCGGGGATTC
TCCATGGCGC TCGGCCGGAT CGGCGGTGCG TCGGACGGCG ACTGCGTCGC CGTGATGGCT
TTCTCCACGG ACCCGGACGG CGCGGACCCG GACGGCCCGA ACCCGGACAG CGCGGGCCCG
GACAGCGCCG AGCCGCGGCT GCGCGCGCTG TTGCACTTCG TGCCGTGGGG ACGGACGGGA
CTTTCACTGG ATGCGATGAT CCGTGACCGG ACGGCGGACA ACGGGCTGAA CGAGTTCCTG
ATAGTCAGTG CCCTGCGTCA GGCCGGCGAC CTCGGGGTCG AGAGGCTGTC CCTCAACTTC
GCGTTCTTCC GGTCCGCGCT CGAACGCGGT GAGCGCCTCG GCGCCGGGCC GGTGATCCGT
CACTGGCGCG GCCTGCTGAT GTTCTTCTCC CGCTGGTTCC AGATCGACAG CCTGTACCGG
TTCAACGCGA AGTTCCAACC TGTGTGGCTG CCCCGCTACG TCTGCTATCC GACGTCCGCG
GAGCTGCCCC GGATCACGCT GGCGATGCTC AGGGCTGAGG CCTTCCTCGT CCGGCCACGC
TGGTGCTCCC GCCTCCCCCG GCCCTCCCGG CCTGCCAGGC GCCCGGGGTG A
 
Protein sequence
MAIASRSGPL PGPLTTSDRR EPDGPGARGS GRRDDRRADR PRAADSTRPA DGPRITRGPR 
RADSPRRTEG PWRASGPRVA ALVTLLLGLV DIAAVLTPGW HSRLEALREL LPAAASRQAA
AFTVVVGMLL VLLSAGLRRR KRRAWRAVVI LLGSSVILHL ARGLDYEEAA GSAALCVALL
LAHRQFQAKG DPTTRWRAAG VGLLLTVVSI GVGLLLLNLR GSRIAGPHPL SAELEQIVLG
LVGIPGPLGF SSARFADLAN RMLLTMGVLT IGSTAYLALR PPEPRPRLTD ADEARVRDLL
AGHGCADSLG YFALRSDKSV IWSPTGKSCV AYRVVSGVML ASGDPLGDRE AWPGAIREFL
REAADHAWTP AVIGCSEAGG TAWTRAGLSV LEFGDEAVVE TAGFTLEGRT MRNVRQAVAR
VERAGYTVDI RRVRDLTPQD VDRLKAQAAA WRGTETERGF SMALGRIGGA SDGDCVAVMA
FSTDPDGADP DGPNPDSAGP DSAEPRLRAL LHFVPWGRTG LSLDAMIRDR TADNGLNEFL
IVSALRQAGD LGVERLSLNF AFFRSALERG ERLGAGPVIR HWRGLLMFFS RWFQIDSLYR
FNAKFQPVWL PRYVCYPTSA ELPRITLAML RAEAFLVRPR WCSRLPRPSR PARRPG