Gene Franean1_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2549 
Symbol 
ID5670943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3030879 
End bp3033377 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content73% 
IMG OID641241465 
Producthypothetical protein 
Protein accessionYP_001506885 
Protein GI158314377 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.207629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGC TCGTCGCGGC GACCGCCACG CCGTCGTCGC CGCGCACCAC CTACCTGACC 
GACCCCGCCG GCTTTCTCCA CCAGATCCTC GGCCAGCTCC AGGGGTGGGC AACGGTCTGG
GGACCCGTCG CGGGCCCGCT GCTCACCCTT ACCGCCGCCG GGCTGCTCAC TCTGCGCCGG
CGGATGCGCC GCCGCTACCA GCAGCGGCTG ACCGCCGGCG CCCGCCTCGT GACCGTGTTG
GCCCCGCCCA CCGTCGACCC GGCGGGCGCG ACCGCGCTGT GGTCGAACCT GCTCGGCCTG
CTCCGCCCCG GCTGGCGGCG CCTGTTCGGC CAGCCGCACC TGCTGTGGGA GTACCAGTTC
ACCGCCGACG GGGTCCGTAT CCAGATCTGG GTGCCCGGCG TCGTCCCCGA TGGGTTCGTT
GAACGCGCCA TCGAGGCGGC CTGGCCTGGC GCTCACACCC GCACCACGCC CGCCCGCGCG
CCGCTGCCCG TCGTCGCCCG GCCGGGTCGG CGGCTGCTCG CTGCCGGCGG CGAACTGCGC
CTGGCCCGCC CCGAAGCACT CCCGATCCGG ATCGACCACG ACGCCGACCC GATCCGCGCC
CTGCTCGGCG CGCCCGGCAG CCTCGCCCGC AACCAACGGG CGGCCGTGCA GATCCTGGCC
CGGCCGGTCA CCGGCCGCCG CGTCGCGAAA TCCCGCCGGG CCGCCCGCCG GCTGCGCTCC
GGCGGCTCCG CGCACCTGAT CGGCGGGCTG CTGGACCTAC TCACTCCCCA CACCGGCCGC
ACCCGGCGGC GCCACCGGAC GGCGACCACC ACGGTGAAGG TCGACCCGCA GACGTCGCTG
GCCCTGTCCG CGGAAGACCG CGCCATCGTC ACCAAGCAGC GAGGGGCCCA GTACGAGGTC
CGCGTCCGCT ACGCCATCGC CGCGATCCTC GACGAGCACA CCGACGACAC CACCGCCGCC
CAGGTGGCGA GCCAGCTACG CGGCCGGGCG CACGCCATCG CGTCGGCGTT CTCCGCCTAC
GGCGAGCACA ACTACTACCG CCGCGTCCGG CTCCGCCGCC CGCTGCCCGT CCTCGCCACC
CGGCAGTTCG GCCGTGGCGA CCTGCTCTCG GTCGCCGAAC TCGGCGCGCT GGCCCACCTG
CCGGTCGACG AGGCGACCCC TGGCCTGCAA CGCGCCGGTG CGAAGGCGGT CGCCCCGCCG
CCCGGCGTCG CCGGCCACGG TCCGAACGTC CGCCCGATTG GGCGCACCGA CGCCGGTCAC
GCGCGTCCGG TTGGTCTGCG GGTCCCGGAC GCCCGTCATC ACCTGCACGT CCTCGGCGCG
ACCGGCGCCG GCAAGTCCGA ACTGCTCGCC CGCATGACCC TCGACGACGT CGCCGCGCAC
CGAGGCGTGG TCAATGTCGA CCCGAAGGGC GACCAGATCA TCGACATTCT CGCCCGCTAC
CCCCTCGACG CCCTCGACCG CCTCGTCCTG TTCGACGCTG AGTCGTCCAG CCGGCCGCCG
TGCCTGAACC CGCTGGATCA GCCCGACCGG GGACGGGCTG TCGACAACCT CGTCTCGATC
TTCAGTCGGG TGTATGCCGA CTCGTGGGGG CCGCGCACCG AGGACATCTT CCGCGCCGGC
CTGCTCACCC TCGCCGCCCA ACCCGGTGTC CCCGTCCTGA CTGACCTACC GAAACTGCTC
ACCGACACCG CCTATAGGCA CCGGGCGCTC GGTGAGATCG ACGACGACAT CCTGGCCGGC
TTCTGGACCT GGTACGAAGC TCTCTCCGAC GCCGCCCGCG GTCACGTCGT CGCCCCGCTC
ATGAACAAAC TCCGGGGCTT TCTGCTGCGG CCGTTCGTGC GGGCCGCGAT CGCCGCCGGC
CCCTCCACCG TCGACATGGA CAACGTGTTG AACGACGGCG GGGTGTGCCT GGTCCGCATC
GCCCAGGACG CCCTCGGTGT CGAGACCGCG GCACTGATGG GCTCCATCGT CGTCTCCGCC
GTCTGGCAGT CCACCACCCG CCGCGCCCGC CTACCCCAGG GGAAAAGGCC CGACGCCAGC
CTGTACTTGG ACGAGGCACA CCATTTCCTC ACGCTTCCGT ACGCGTTGGA AGACATGCTC
GCCGCCGCCC GTGGCTACCG GCTCGGGATC ACCCTGGCGC ACCAGAACCT TGTCCAGCTG
CCTCGGCATC TGGAAGAAAG CATCGGCGCC AACGCCCGCT CGAAGATCTA TTTCACGGTG
AGCCCGGCGG ACGCCAAGCG CCTCGCCCGG CACACCGAGC CCCGGCTCGC CGAGCACGAC
CTGGCCAACC TCGGCGTGTT CCACGCCGCC GCCCGCCTCG TCGTCGGCGG AGAAGAAGCC
CCCGCCTTCA CCCTCGTCAC GGAGAAACTG CCCCCGCCAG TACCCGGCCG CGCCGTCCAG
ATCCGCCGGG CCTTGCGCCG CCGCGCCGCC ACCCCCGCCG CGCCGGCTCC CACCGGTCCG
GGTCCGCGGC CCACCTCCGA CCCGCGTCGC GTCGCCTGA
 
Protein sequence
MDLLVAATAT PSSPRTTYLT DPAGFLHQIL GQLQGWATVW GPVAGPLLTL TAAGLLTLRR 
RMRRRYQQRL TAGARLVTVL APPTVDPAGA TALWSNLLGL LRPGWRRLFG QPHLLWEYQF
TADGVRIQIW VPGVVPDGFV ERAIEAAWPG AHTRTTPARA PLPVVARPGR RLLAAGGELR
LARPEALPIR IDHDADPIRA LLGAPGSLAR NQRAAVQILA RPVTGRRVAK SRRAARRLRS
GGSAHLIGGL LDLLTPHTGR TRRRHRTATT TVKVDPQTSL ALSAEDRAIV TKQRGAQYEV
RVRYAIAAIL DEHTDDTTAA QVASQLRGRA HAIASAFSAY GEHNYYRRVR LRRPLPVLAT
RQFGRGDLLS VAELGALAHL PVDEATPGLQ RAGAKAVAPP PGVAGHGPNV RPIGRTDAGH
ARPVGLRVPD ARHHLHVLGA TGAGKSELLA RMTLDDVAAH RGVVNVDPKG DQIIDILARY
PLDALDRLVL FDAESSSRPP CLNPLDQPDR GRAVDNLVSI FSRVYADSWG PRTEDIFRAG
LLTLAAQPGV PVLTDLPKLL TDTAYRHRAL GEIDDDILAG FWTWYEALSD AARGHVVAPL
MNKLRGFLLR PFVRAAIAAG PSTVDMDNVL NDGGVCLVRI AQDALGVETA ALMGSIVVSA
VWQSTTRRAR LPQGKRPDAS LYLDEAHHFL TLPYALEDML AAARGYRLGI TLAHQNLVQL
PRHLEESIGA NARSKIYFTV SPADAKRLAR HTEPRLAEHD LANLGVFHAA ARLVVGGEEA
PAFTLVTEKL PPPVPGRAVQ IRRALRRRAA TPAAPAPTGP GPRPTSDPRR VA