Gene Franean1_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4247 
Symbol 
ID5672602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5059027 
End bp5061438 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content72% 
IMG OID641243120 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001508537 
Protein GI158316029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTGGA GGGTAGCGGC CCTACGACGA CTGGAGCGGC TCCGGACTCT GCGGATCGTC 
TCACGGGGGC GTCCACCGCG GGCACCGGAG TCCAACCAGC CCCGCTTGGC CACGCCCCTC
GCCCTGGTGG CCCTGGGAGC CCTCGTCGCC GCCTCCGGAT TCCGCGGTGC CTTCGAGCTG
ACCGCGCGTG TCGCTGTGCC AGTCGGGCTT GCGGCTGTCC TGCCGCCGTT GATAGCCGCC
GTCATGGCTG TGAGCACCCC GGCCCGGAAG GTTTGGCTGA CGGCCGCGAC GTCCACCGCC
GCGTGGCTGG CCGTCCTCGT CCTGACCGTT TTCGGACCGT CCGGAGCATG GTTTCCGGCC
GGTCTCGTAC CCGATGCGCT GGCTCATGGG TTGGACCGGT TGTTACAGAT CACACTGCCC
GCACCGCCGC GAGCTGATCT GCTGGCGGTG GTGGTGACAC TGATCTGGCT CGCCGCCGCG
TCCGCGTCCA TGCTCGTCGC CGCGGGCACC GGACGGGACA CGCTGACCCC GGTGGCCCCG
ATCGCAATGC TTTTTGTCGC GGCGACCCTG ATCAGCCTCC CCGGCCCTGG TTCACACGTC
TCGACCGCGT CCGTCCTGGT CGCGGTGTTC GCGCTCATCG CAGCCGTCAG CCGACCGTCG
GCCGGGCGGT GGGACCGGGC CGCCGGACCG GCACCGGTGG CGGGTGCGCC GCCGGCGGCC
GGCGCGTCGC CTGCGCAGCG GCGGGCCCAC GGAACTGGGC GGCGGCCGTC AACGAAGGCA
GGAAGCCTGC GGCACCTGGT CGCGGTCCTC ACCATCACCT GCGTTGCGGC TGGCTCGTTC
GGTCTGGCCT CCCTGGTCAC CTTGCTGGAC CCCGATCCGT TCGACGTCCG CGAGTACCGG
TCGCCGCCGG CGCGGACCGT CCAGCAGGTT GACCTACTGG CCCTCGTTTC AGCCTGGCAG
GCCGAGCCCC GGACCCGGCT GTTCACGCTG GAGGCGACCG ATGCGTTGAT CGAGGGCGCT
CCGCCGGACC GAATGCGTCT CGCCGTCCTG GACCGCTACG ACGGCAGGAA CTGGAGCAGC
GCGTCCCGCT ACATCCCGGC GGGCCTCGCT GTTCCAGCTC CGGAGGATGT GGATGGACCG
GTAGCCAGAC ACACGCTGAC AATCGATCGT CTGAGCGGCC CCTACCTGCC GGTCCTTGGT
TGGCCGACCC GGCTCGACGC GTCCGGGCTG CGCGCCGGGC GGCGCACCAG CGCGCCGCTC
GCGGTCGACC TCGAGAACGG TCTCCTTTCC GCGGACGCGC GCCTGGAGCC GGGCCGGACG
GTGGAGATCA CGTCGATACT CGGCGCCGCA CCGACCCTGC GTGAGGCCGG CGAACGGCCG
GTCGACCCGA TTGCCGAACC CGTTCTCGCT CTGCCGGGTA ACGTGGCGCC GCCGGGCGAC
CTGACGGCGA TCGCGCAGAA GGCATCGCGG TTGGCGGCCG TCCCAGGTAA ACGAGCTTCG
GTACTTGCGG ATCTGCTGGC CCAGGGTCGG AAGCTTGATC GGACCGTGAT CTCCGGCAGC
TCACTCGGCG ATGTCGAGGC GTTCCTCGGG GCGAGCCGCG TTGGTGGCGA CGCGCTGTTC
GCGACCGCGT TCGCACTGGC AGCCAACACC ATCGGCCTCC CGACCCGGCT GGTGATCGGG
TTCGACCGGC CCAGTTCGGC ATCTGACGGT GCTGTCCACG CCGGTGACGT CCGTGTCTGG
CCGGAGGTGC GGTTCGCCGA GGTCGGCTGG GTTCCGGTGG ATCTCGGCTC GAGCTCGGCC
ACCGGTACCG GCGAGACGCC GCTACCGTCA GCCGGGACTC CTGATACAGG CGTGCATTCG
TCCACGCCGT CCACGCTCAG TCCGCCCGCG AATGGTGTGC GGGTGGCTCC ACCGCGTACC
CCGCTTCCCC GGCCCGACCG CCCGGTGTGG GCCATGATTA TAGCGGTGAT TGTCGTCGTC
CTCGGCCTTG CCGTTACCGG GGCGTGGTGG GCCACCAGGG CGGAACGACG GCGCCGACGC
GCCCGTTCCG AGGCGTCGCC CCGCCGTCGT CTGATCGAGG CATGGTGGGA CGCGGTCGAG
ACGATGGGTG GTCGGCGCCG TACGGTCCTG TCCTCCGACA CCTGCGCCGA GGTCGTGCGC
GAGGCTCGGG AGGTCTACGG CGAGCGTGCC GCCGAGCCGC TCGCCGAACT CGGCACGGAC
GCCGCGAGGG CGCTCTTTTC CGCGAGTGAT CCGCGACCTG CGGAGGCGGA CCGTGCGTGG
GACCTGAACC AGCGCTTCCG TAAGCGCCTG CGGACCGAGC GCCGCCGTCG GCGCCGTGCC
GCGGTGCGCG CAGCACCACG GCACCTCAGC CGGGCGCTGC GCAGGGCCGG CGCGCAGGCA
CGTCGGCGAT GA
 
Protein sequence
MTWRVAALRR LERLRTLRIV SRGRPPRAPE SNQPRLATPL ALVALGALVA ASGFRGAFEL 
TARVAVPVGL AAVLPPLIAA VMAVSTPARK VWLTAATSTA AWLAVLVLTV FGPSGAWFPA
GLVPDALAHG LDRLLQITLP APPRADLLAV VVTLIWLAAA SASMLVAAGT GRDTLTPVAP
IAMLFVAATL ISLPGPGSHV STASVLVAVF ALIAAVSRPS AGRWDRAAGP APVAGAPPAA
GASPAQRRAH GTGRRPSTKA GSLRHLVAVL TITCVAAGSF GLASLVTLLD PDPFDVREYR
SPPARTVQQV DLLALVSAWQ AEPRTRLFTL EATDALIEGA PPDRMRLAVL DRYDGRNWSS
ASRYIPAGLA VPAPEDVDGP VARHTLTIDR LSGPYLPVLG WPTRLDASGL RAGRRTSAPL
AVDLENGLLS ADARLEPGRT VEITSILGAA PTLREAGERP VDPIAEPVLA LPGNVAPPGD
LTAIAQKASR LAAVPGKRAS VLADLLAQGR KLDRTVISGS SLGDVEAFLG ASRVGGDALF
ATAFALAANT IGLPTRLVIG FDRPSSASDG AVHAGDVRVW PEVRFAEVGW VPVDLGSSSA
TGTGETPLPS AGTPDTGVHS STPSTLSPPA NGVRVAPPRT PLPRPDRPVW AMIIAVIVVV
LGLAVTGAWW ATRAERRRRR ARSEASPRRR LIEAWWDAVE TMGGRRRTVL SSDTCAEVVR
EAREVYGERA AEPLAELGTD AARALFSASD PRPAEADRAW DLNQRFRKRL RTERRRRRRA
AVRAAPRHLS RALRRAGAQA RRR