Gene Franean1_4278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4278 
Symbol 
ID5672633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5116082 
End bp5117317 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content78% 
IMG OID641243151 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001508568 
Protein GI158316060 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0535647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.226875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA ACGCCGGTGT GGCCCCCGGC GCTCATGGAA CCTGCTCCTG CGGCGGCGCG 
GCCGGCGTCG CCACCAGCGG CGCCGCGCGA CGGTCCGCGG GCCGGCGCTG GCTGTGGATC
GTCGCCGGGC TGGCCGCGGC CGGCGCGGCG GCGGCCGCGG TCGGGATCTG GCATCTCCCC
CCGCGGATGT ACCCGGACCC AGGCGACACC GACGCGCGGG CGGCCCTGCA GGGCGGCCTG
CTGACCGCGG CCTCGGCCCT CATCGCCGTG GCCGGCGCCC TGGTCGCCCT GGACGAGACC
CGGGTGGCCA ACACCGAGAC CCGGCGGGCG AACGAGGCGG CCGACGAACG CGAGCGGCAG
GCCTACGCGA ACACCCACGT CCGCGAGCTC TACACCCGGG CGATCGACCA GCTCGGCTCG
GACAGCGACA CGATCCGCCT GGGCGGCATC TACGCCCTCG AACGGATCGT CGCCGACAGC
CCCGCCGACC GGCGGGCCGT CGTCGAGGTC CTCGCCGCCT TCGTCCGCAC CCTCAGCACC
GATCCCCGGC GCGCCCCGGC ACCCGCCGCA CCCGCCGCGC CGTCCGCCAA GCCCGGGCGG
CGCGGGCCGT CCCGGCCGCC CGCCGTCGAC ATCCGCGCCG CCGTCGGCGT CCTCGCCCGG
CTCCCGCACC CCGCGGACCT CACCGGCACC AACCTGACCG GGCTCACCGG CCTCACCGGC
CACGCGGATC TTCCCGGTGC CCCCAGCCTC GCCCACCTGA CGCTCACCAA CGCCACCCTG
GCCGACGCCC GGCTGGCCGG GGTCGACTTC ACCGGCGGCA GCCTGGACGA CGTCGATCTC
GCCCGCGCCG ACCTGCGCCG GGCGAACCTC ACCGACGCCG AGCTTGTCGA CGCGGACCTC
ACCGGCGCCC GGCTCGCCGA CGCGACCCTT GCCGGCGCCC TGCTCTTCCG GGCGACCCTC
ACCGGCGCCC AGCTGGGCCG GGCCGATCTC ACCGGCGCCC AGCTCGGCGG CGCCGACCTC
ACGAACGCCG TCCTGGACGA GGCGATCCTC GCCGACGCCG TCCTCTCCGG GGCGAACCTC
ACCAACGCCC GACTGGACGG CGCCGACCTC ACCGCCGCCA CCGGCCTGGC CCAGAAGCAG
GTGGACTCCG CGCGCGGCGA CCGGCGGACC CACCTGCCGG CGGGCCTGGC CCGGCCGGCG
TCATGGGACA CCGCGGAAGG GCCGGCCGGG CAGTAG
 
Protein sequence
MTDNAGVAPG AHGTCSCGGA AGVATSGAAR RSAGRRWLWI VAGLAAAGAA AAAVGIWHLP 
PRMYPDPGDT DARAALQGGL LTAASALIAV AGALVALDET RVANTETRRA NEAADERERQ
AYANTHVREL YTRAIDQLGS DSDTIRLGGI YALERIVADS PADRRAVVEV LAAFVRTLST
DPRRAPAPAA PAAPSAKPGR RGPSRPPAVD IRAAVGVLAR LPHPADLTGT NLTGLTGLTG
HADLPGAPSL AHLTLTNATL ADARLAGVDF TGGSLDDVDL ARADLRRANL TDAELVDADL
TGARLADATL AGALLFRATL TGAQLGRADL TGAQLGGADL TNAVLDEAIL ADAVLSGANL
TNARLDGADL TAATGLAQKQ VDSARGDRRT HLPAGLARPA SWDTAEGPAG Q