Gene Franean1_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2029 
Symbol 
ID5670430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2439229 
End bp2440773 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content78% 
IMG OID641240950 
Productkelch repeat-containing protein 
Protein accessionYP_001506372 
Protein GI158313864 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.762762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGGT CACGAACGTC CCGCAGATGC GCCTCACGCG CGTTGGTCCT GAGCACCGCG 
CTGGCGGCGG CTCTGGCGAC ACCGGCGCTG CTCGCGCCGG GCGCCGAGGC CCTGCCGACC
GGTACGCCAA GCCCCACAAG CCCGACCAGC ACGTCCCCAC AGCCGTCCGC GGGCGCGGCC
GGCGCCTGCC CGGCGCTGCC CCACATCGCC GGCCCGCTGC GCACCTGGGA GACCGCCGCC
GACCTTCCGG CGCCACGCGC CCACGTCGCG GCCACCACGG GATGCGACGG CCGGATCTAC
CTCCTCGGCG GCGAGGCGAC CGCCGCGGGC GGTGGTGGGG TCGGCGGGGG CGTGCCCGCG
ACGCCGTCCG CGACGCCCAC CACCCGCTCA CCGGCGACGT CGACGCCCAG CGGCTCGGCG
ACCCCCAGCC GGACAGCCAC GCCGACAACC ACGCCGACCG CGACGTCGAC CACAACCGGG
TCACCCGGCG GCGAGGGCGG TTCGACGCCG AGCCCGGGCG TGCGGCTGAC CCCGGCCGCC
TTCAACAACG GGGGTGTGCC GACCAGCACG CCCGGCGAGC TCGCCCCCCT GTCCCCGAAC
GGGTCCGCCA CGGACGAGCC CACGACGACG CCGACGACAC CGACCACGCC AACGGCACCC
ACGACGTCCC GCACACCGAC CGCCACGCCG ACCTCTCCGG CCGGCGGCGG CACGCCGACC
GCCACGCCGA ACGGCGCGAC GGGCGGCGGG ACGACCGGCG GCGCGACTGT GGCGGGCGCC
CCGGTTGACA CCGTCCAGGT CTACGACCCG AAGCGGGACG CCTGGTCGAA CGCCCCGGCC
CTGCCGACCG CGCGCGACCA CCTGGCCGCT GCGACCGACA CCGACGGCCG GATCTACGCG
ATCGGCGGCC AGACCGGCGG CGGCACCCCG ACCGACACCG TCGAGGTGTA CACGCCGTCG
AGCGGTGACT GGACGGAGGG CCCGTCGCTG CCGCGCCCGA TGGGCACGCC GAGCGCCACC
CGCGGCACCG ACGGGAAGAT CTACGTCCTC GACGCGACGA CCCTGGCGGT CTACGACCCC
GACTCCGGTG ACTGGACGAC GGGCGGGGCG CCGCCGTCCG GTGCCGGCGC ACCGGTGCTG
GTCGGCCTGC CGGACGGGCG GATTCTCGCC GCGGGCGGCA GCGACGGCGG CTCCGGTTCC
GAGGCGTCGT CCGACGCCTA CGCCTACACG CCGGGCTCGG GTTCGGCGGA CGGCTCCTGG
GCGAAGGTGG CCGACCTGCC GACGGGTGTC TCCGAGGCCG CGGGCGCGAC CGGGCCGGAC
GGCCGGGTCT ACATCGTCGG CGGGCGGGAC AGCGACGGCG ACACCGTCGG CTCCACCCAG
GTGTTCACCC CGGACGACGA CCGCTGGTCC GCGGGTGCGA GCAGCGCCCT CACCGCGCGC
TCCGGCCACG GCGCGGCCAC CGGTGGGGAC GGCCGCGTCT ACGTGGCGGG CGGGACGTCC
GGTTCCGGCG CCCCGCTCGA CTCGGCTGCG GCCCTGGGCG AGTAG
 
Protein sequence
MSWSRTSRRC ASRALVLSTA LAAALATPAL LAPGAEALPT GTPSPTSPTS TSPQPSAGAA 
GACPALPHIA GPLRTWETAA DLPAPRAHVA ATTGCDGRIY LLGGEATAAG GGGVGGGVPA
TPSATPTTRS PATSTPSGSA TPSRTATPTT TPTATSTTTG SPGGEGGSTP SPGVRLTPAA
FNNGGVPTST PGELAPLSPN GSATDEPTTT PTTPTTPTAP TTSRTPTATP TSPAGGGTPT
ATPNGATGGG TTGGATVAGA PVDTVQVYDP KRDAWSNAPA LPTARDHLAA ATDTDGRIYA
IGGQTGGGTP TDTVEVYTPS SGDWTEGPSL PRPMGTPSAT RGTDGKIYVL DATTLAVYDP
DSGDWTTGGA PPSGAGAPVL VGLPDGRILA AGGSDGGSGS EASSDAYAYT PGSGSADGSW
AKVADLPTGV SEAAGATGPD GRVYIVGGRD SDGDTVGSTQ VFTPDDDRWS AGASSALTAR
SGHGAATGGD GRVYVAGGTS GSGAPLDSAA ALGE