Gene Franean1_2668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2668 
Symbol 
ID5671061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3155971 
End bp3158907 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content71% 
IMG OID641241583 
Producthypothetical protein 
Protein accessionYP_001507003 
Protein GI158314495 
COG category 
COG ID 
TIGRFAM ID[TIGR03607] patatin-related protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTGA AGCGCCGTGG CGGCCCTCGC CAGCCTGGGG CCGAGGATCT TGAGGACGTT 
CGGATCGCGG TCGTGCTCAA CGGCGGCGTG AGCCTCGCCG TGTGGATGGC CGGTGTCGTC
AACGAGATCA ACACGCTGGC GCAGGCCCGC TCCGCCACAC CGCTCTCGGC GGCGCCTCCG
GCCTGGGCCG CCGGCACGGT GTACGGCTGT CTCCTCGACC TCGTTCACGC GACGGCACGG
GCGGACGTGA TCTCCGGCAC CTCGGCGGGC GGCATAAACG GCGCGTTCCT GGCCTACGCC
CAGGCATACG GCGCCGATCT GGCCGGCCTG GGTGCCAAGT GGGCCGAGCT CGGTTCCTTC
GAGGAGCTGC TGCGCGACTC GACGGGACCG GACCAGCAGT CCTTTCTCCT CGGCGACGAC
TACTTTCTGC CCGAGCTGGC ACGCGCGTTC ACCGAACTGG TGCCGGAGGA CAGCCGGGAG
CAGTCGTACG TGCCGCCCGC CGAGCGGCCG GTCGACCTCG TCATCAACAC GACGCTGATG
CACGGCCGGT CGACGCGGCA CGTCGACGAC TTCGGCACGG AATTCGAGGA ATCCAGCCAC
AGCGGCGAGC TTCGATTCAC CCGCACGGCG GACACTCCGC CGTCGGACGA TGTGTTCAGG
GGCGCCGCCA TCGCGCGTCA GCTGGCGCTG GCCAGCCGGA GCACGGCGTC GTTTCCCGTC
GCCTTCGAAC CGAGTTTCCT GCCGGTCGGG GAGGACGACG GGGACGCCTA CCACCCGGAC
ATGGGCGCCG CCGTACAGCG GCCCACGGCC GCGCCGGGCC CCCGGAGCCG GTACGTCCTC
GATGGCGGCA TCCTGCTGAA CCAACCGGTG CGCCCCGCGC TGGCGGCGAT CTACGCCCAG
CCGGCAGACC GGCAGGTTCG GCGGCTGCTG ACGCACGTCA ACCCGGACCC GCGCTCCGCC
GTCCCCGGCG AGATCGCGGG CCTTCTCGGC CGGGGCCGAG CCGGCGACGA GGGGCTCGAC
AGGGCCGGGG GGCACGACAG CGCTGGCGGG CATGACAGCA CCGAGGTGCC CGACAGCGCC
GGCAGGCCGC CGACCCTCGC TGCCGTGCTG GCAACGCTCG CCGTCCTGCC GTATGCGCAG
TCAGTCGAGG TCGAGCTCCA GCAGCTCCAG GAAACAAACG ATCGCGTCCG GCGACACAGA
TCGGGACGCG CGCACCTGAT CGGTTCCCTG GATGAGAAGC TCGCCGGCCA ACTCATGGAC
CAGTACCGGA CGATGCGGTC CTGCCAGGAG GCCGACCGCA TCGGGGCCGT GTTCGCGGCG
GCCCTCGACC GACAAAGGCT GTGGTGGACG CGCGACGAAC TCGCGGGGGC CCTGCTGGAT
GTGGACCTCG ACGGGATCCC CTGCGTTCCG CGCGCCCTGC CCCGTCGGAA CGCCGATCCC
AAGCGGTGGG CCTGGGGAAT CGGGCCGGTT GAACGCATCG GTGTGCTCGC CGTCGACCTG
TTCAAACGCG CGATGTGGCT TGCCCCGCCA GCGGATCACG CGCTTCGCGG CCGGTTGCGT
GGATACCGCG GGCAGCTGCA CGGCGAGCTG GCGACGCTGC GCGCGATCCG CCGCGACGAC
GACGATTTCT GGCGAAGCTG GTCGGCCCGG CCGCCGGCGC AGCCCGGGGC GGACGAGGAC
CCGCGGGATT CGGCGGATGC CGGCGACGGC TCGGACCCGG TCGAACCGGC GGCACCAGGG
GGCTCCGCGG AACGACGGGC GCGGCTGCGT ACCTGGCTGC AGGACGCGCT CGCATCCTGG
CCGCTCCCGC CCGGGGCGGC GGACATCCCC CGGGACACGT TGCCGGTACG GCTGGGCACG
GTGGCGGAGC GGATAGTTCG CCTGCTGATC GATGCGGCGG ACGACCTCTG GCGGCTCGGG
GCGCCGGACG CGGGTGACGC GGTAGCCACC GTTCCTCACG TGGCCGTTGA GACCGACCTG
CTGAAGAAAC TGCTGCGCGC GCTCCTGTCG GATGACCTCC GGAGTTCCCC GGCAGCACTG
CGGCGGCGGT TGTTGGCGGT GGAGGTCTGT CAACTCGCGA TGGCCGGCGC ACCGCGGGAC
GCGGAGCAGG AGGTGATCTT CCAGCAGATC AGCGGCTTCA CCCCGAACTC GTTCGGCGGA
CCGGCGACGC CGGACAAGAT CGCCGGCATC AGGCTGTTCT GGTTCGGCGG TTTCCTGAAA
AGATCATGGC GGGTCAACGA CTGGATCTGG GGCCGGTTGG ACGGCGCGAC GCGCATGGTT
CAGGCGGTCC TCGACCCGGC GAGGCTGCGC CAGCTCGGCC GCTCCGCTCA GGAGACCCAC
GACGCGCTGC ATCGGATCGC TGTCGGTGGG GCCTACCGGG ACGACCTCTC CGGGTATTTC
CGCGACGCGC GGGACGAGAT CTTCGCCGAA CTCGCCTTCC TGGACTCGGA GGATCCCGGC
GCGCAGGCAG GCTCCCTGAC GGCGACGACG CGCGCGGTCA CGCGGCGGCT CCACGCCGAC
ATCCTCGCCG AGGAGATGCC GCACCTCGTG GAGGCGCTGC GGCACGACAT CGCCAAGGCC
TCCGGCGGGA GGCCCGTCGG CTGTCGCTTC CTGGAGAAGC AGGCCGCCAC GCTCCGGCCG
GCGGCCCCCC AGCCGTCGCT GGACGACCTG TTCGCCCAGT TTCTGGAGGC GAACGCGGAG
ATGATCGGTG AGCCTCTCAG GCACGCCGTT CCGCGGGGTC TCCTGGGACG CTCGACAACA
CTGGCCCTCG GGCTCATCGG GGAGCTGGCG AAGATCTCGT TGTCCAACGC CGTGGAACTG
GTCGTCGAGG AACTCCCGCT CGGGCTCAGG TGGCCCGCTC ACGTGGTGAC CGCGCCGGGC
CGGTGGGCCA TCCGAAAGAT CTCGGGGGTG TTCGAACCTT GGGTGCCATC TCCGTGA
 
Protein sequence
MKVKRRGGPR QPGAEDLEDV RIAVVLNGGV SLAVWMAGVV NEINTLAQAR SATPLSAAPP 
AWAAGTVYGC LLDLVHATAR ADVISGTSAG GINGAFLAYA QAYGADLAGL GAKWAELGSF
EELLRDSTGP DQQSFLLGDD YFLPELARAF TELVPEDSRE QSYVPPAERP VDLVINTTLM
HGRSTRHVDD FGTEFEESSH SGELRFTRTA DTPPSDDVFR GAAIARQLAL ASRSTASFPV
AFEPSFLPVG EDDGDAYHPD MGAAVQRPTA APGPRSRYVL DGGILLNQPV RPALAAIYAQ
PADRQVRRLL THVNPDPRSA VPGEIAGLLG RGRAGDEGLD RAGGHDSAGG HDSTEVPDSA
GRPPTLAAVL ATLAVLPYAQ SVEVELQQLQ ETNDRVRRHR SGRAHLIGSL DEKLAGQLMD
QYRTMRSCQE ADRIGAVFAA ALDRQRLWWT RDELAGALLD VDLDGIPCVP RALPRRNADP
KRWAWGIGPV ERIGVLAVDL FKRAMWLAPP ADHALRGRLR GYRGQLHGEL ATLRAIRRDD
DDFWRSWSAR PPAQPGADED PRDSADAGDG SDPVEPAAPG GSAERRARLR TWLQDALASW
PLPPGAADIP RDTLPVRLGT VAERIVRLLI DAADDLWRLG APDAGDAVAT VPHVAVETDL
LKKLLRALLS DDLRSSPAAL RRRLLAVEVC QLAMAGAPRD AEQEVIFQQI SGFTPNSFGG
PATPDKIAGI RLFWFGGFLK RSWRVNDWIW GRLDGATRMV QAVLDPARLR QLGRSAQETH
DALHRIAVGG AYRDDLSGYF RDARDEIFAE LAFLDSEDPG AQAGSLTATT RAVTRRLHAD
ILAEEMPHLV EALRHDIAKA SGGRPVGCRF LEKQAATLRP AAPQPSLDDL FAQFLEANAE
MIGEPLRHAV PRGLLGRSTT LALGLIGELA KISLSNAVEL VVEELPLGLR WPAHVVTAPG
RWAIRKISGV FEPWVPSP