Gene Franean1_6284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6284 
Symbol 
ID5674603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7631856 
End bp7633001 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID641245136 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001510532 
Protein GI158318024 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA ACGGGGGAGC ACGCCGAGTC GGTCGGCGGT GGCTGTGGAT CTGCGCTGGG 
ATCGCCGCGG CCGGCGCCGT GGCCGCAGTC GTGGGAATCT GGCACCTGCC CGACCGGATG
TACCAGGGCG AGGAGGCACG GGCGGCGCTG CAGGGCGGCC TGCTGACGGC GGCCGCCGCG
CTGACCGCGG TGGCCGGTGG TCTGATCGCG TTGGACGAGA CCCGGCAGGC TAACGCGGAG
ACCCGGCGGG CGAACCGGGA TACCCACGTG CGGGAGTTGT ACGTGGAGGC GGCGAAGCTA
CTCAACGACC CGGACAACCT CGGGGTCCGC CTGGCCGGGA TATACGCCCT GGAACGGATC
GCGGTGGATT CCTCGGTGGA TCAGCGCACG GTCGTGGAGG TGCTCTCCGC GTTCGTGCGA
ACCCGCAGCA CCGACTCCGC GCTACGCCCA CCATCACCCG CCGATAGTCA GGAATCACCA
CTGGTGCGGC CGGCAGCGGA CATCCGTGCC GCCGTCCAGG TCCTGGGTCG CCTCCCGGCC
CTCGATGGTG TCCCACGCTG CGACCTGGAC GGCGCGGACC TCACCGGTCC CGCCGGGCTC
GGGGGCCTCG ATCTTTCTGA GGCCAACCTT CTGGGCGCCC AGCTGGCTGG GGCGGAACTT
ACCTACGCCG TGCTGCACGG AGCGAATCTC ACTGGCGCCC GGCTGGACGG TGCGGACCTC
ACCTCCGCCG TGTTGATAGG AGCGCACCTC GCTGGCGTCA AGATGGACGG GGCGAACCTT
AGCGATGTCC GGCTGGTGGG TGCGGACCTG ACCTTCGCTC AGCTGGGCGG AGCAAACCTC
ACCAACGCCT TCCTCGCCAT GGCCACCATG ACCTATGCCG TGTTGGAGGG AGCGCACCTC
GGCGGCGCCC TGCTGGCCGG AACGAATCTC ACCGGCGCCC GGCTAGTTGG TGCAGATCTC
ACCGGCGCCC AGCTGGTGAA TGCGGATCTC ACCGCTGCCC AGATGGAGGG GACGGATCTC
ACCGGCGCCC GGGGCCTGGC GGCGGAGCAG GTGGCAGCGG CGTCCGGGGA TGCGCGGACG
CGGTTGCCGG ACGGAGTGGA ACGGCCTGCG TCCTGGCCGC CGTACGAGCC GCCTCCGGAG
CAGTAA
 
Protein sequence
MADNGGARRV GRRWLWICAG IAAAGAVAAV VGIWHLPDRM YQGEEARAAL QGGLLTAAAA 
LTAVAGGLIA LDETRQANAE TRRANRDTHV RELYVEAAKL LNDPDNLGVR LAGIYALERI
AVDSSVDQRT VVEVLSAFVR TRSTDSALRP PSPADSQESP LVRPAADIRA AVQVLGRLPA
LDGVPRCDLD GADLTGPAGL GGLDLSEANL LGAQLAGAEL TYAVLHGANL TGARLDGADL
TSAVLIGAHL AGVKMDGANL SDVRLVGADL TFAQLGGANL TNAFLAMATM TYAVLEGAHL
GGALLAGTNL TGARLVGADL TGAQLVNADL TAAQMEGTDL TGARGLAAEQ VAAASGDART
RLPDGVERPA SWPPYEPPPE Q