Gene Franean1_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0521 
Symbol 
ID5668940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp606994 
End bp608250 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID641239450 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001504888 
Protein GI158312380 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTTT CCGCTAAGCG CGACTTTCTG ATTACTGAAA GGAGAGTGGA GTATCCAGTC 
GCTGGGCGGC GGTTGCGGAT CGCTACTGGG GCCGCAGCGT CGGGGGCTGC GCTGGCGACG
GTGGGGATCT TCTACCTGCG CGCCTCGATG CACCCGTCTG GCCTCGGCGG AGAAGCGGAG
ACTCGGGCGA TGGTGGAGTG CGTGTTGCTG ATTGCGTCGG TCGCGCTAGC GACGCTGGCC
GGCTACCGGG TCGCCGCGGA CGGGATGCGA CAAGCGTACG CGCAGGCATG GAAAGCGGGC
ATCCGGGAAA AGGACAGCAA TGCCCACTTT TGTGCCCGGG AGTTCTTCGG CCGCGCGGTC
GACCGGCTCG GCAGCGAGAA CAACACGACT CGCCTTGGCG GGATCTACGC CCTGGAGCGG
ATCGCGATGG ACTCACCGAG TGACCAGCGC GCGGTCGTCG AGGTCCTCTC CGCCTTCATC
CGCACCCGCA GCACGGACCC CACGCTGCGG CCCGCCGTGT CCGGTCCGGT CGTTCCGCTG
CGCCCTGCCG TGGATATCCA CGCCGCGGTG GCGGTCCTGG GGCGTCTGTC CGTCCTCGAC
GGTGTCCCCC GCGCGGACTT GAGCGGTGCG AAGCTGACAG GTCCCGCCGC CCTGCACTGC
ATACAGGCCA GCTATGGCAA CCTCAGCAAC ACCGACCTCA CCGGAGCGGA CCTCAGCCGC
GCCCATCTGG GTCGGGCGGA CCTCACTGCC AGCCGGCTGG GCGGCACGGA TCTCACGGGC
GCCTCGCTGA ACGAGGCCAA CCTCAGCTAT ACCTGGTTGG GCGGAGCGAA CCTGACCCGC
GCCCGGCTAA GCGGAGCGGA TCTCACCGGT GCATCGCTAA GCGGAGCGGA CCTGACCCGC
GCCTGGCTGG ACGGCGCGGA TCTCACGGGC GCATCGCTAG GCGGAGCGAA CCTGACCCGC
GCCTGGCTGA CCGAGGCGGA CCTGACCCGC GCCTGGCTGG GCGGGGCGAA CCTCATCACT
GCGGTGGGAC TGGTCCAGGA TCAGATCGAC GCGGCATACG GCGACGGGTG GACGCGGCTA
CCGCCGGAAC TAACGAGACC GGCTTTGTGG ACCTCGGCCG AGGCTGACGA GTACCGCCCG
GCAGACCCAC ACCAGGTTGT CGGACAGTGG CATCCGGAGG TGCTGGCGGG GCAGGACAAC
GCCCTGTCCT CCCGGGACTA TCACCACACG ATCCTTCCGA AGGTCGTGAT TGTGTGA
 
Protein sequence
MHFSAKRDFL ITERRVEYPV AGRRLRIATG AAASGAALAT VGIFYLRASM HPSGLGGEAE 
TRAMVECVLL IASVALATLA GYRVAADGMR QAYAQAWKAG IREKDSNAHF CAREFFGRAV
DRLGSENNTT RLGGIYALER IAMDSPSDQR AVVEVLSAFI RTRSTDPTLR PAVSGPVVPL
RPAVDIHAAV AVLGRLSVLD GVPRADLSGA KLTGPAALHC IQASYGNLSN TDLTGADLSR
AHLGRADLTA SRLGGTDLTG ASLNEANLSY TWLGGANLTR ARLSGADLTG ASLSGADLTR
AWLDGADLTG ASLGGANLTR AWLTEADLTR AWLGGANLIT AVGLVQDQID AAYGDGWTRL
PPELTRPALW TSAEADEYRP ADPHQVVGQW HPEVLAGQDN ALSSRDYHHT ILPKVVIV