Gene Franean1_6262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6262 
Symbol 
ID5674581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7607226 
End bp7608452 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID641245114 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001510510 
Protein GI158318002 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.857591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACA ACGGGGAAGC ACGCCGTCGG GCTGGTCGGC GCGGGCTGTG GATCGTCGCC 
GGGATCGCCG CGGTGGGGGC AGTCTGCGCG GTGGTGGGGA TCTGGCACCT GCCGGACCGG
ATGTACCCGC CAGGAACCGA CGGTGGGGCG GAAGCACGAG CCGCGTTGCA GGGCGGGCTA
CTCACGGCGG CGGCTGCGCT CACCGCCGTG GCTGGCGGTC TGATTGCCTT GGACGAGACC
CGGCGGGCCA ACGCCGAAGT GCGGCAGGCC AATGCCAACA CCCACGTCCG CGAGCTCTAC
ACGGCCGCGA TAGGGCTGCT GAGCTCGGAT GCGATCGACA GTCGGCTAGG TGGGATCTAC
GCCCTCGAAC GGATTGCGTG GGATAGTCCT GCCGACCAGT CCACTGTCGT CGAGGTCCTC
TCCGCGTTTG TCCGCGAGCA CGCCCGACCC CTCACAGATG CGCCGGCCGG CCTCCCGGCC
GAGATTCGGG GCCGAGGTGG TGGGGGTCTG CGCAGTCGAC GTCGTCGTGG TCACGCCGCT
GGCAGGAGGT CGGAGGTCCG CCACCGGCTA CCGCCCTGGG ATCGATTTAT CCAGATAGGC
CCATGGAGTA ACGAGGCTCC GCCACCCACT GATGTGCAGG CGGCTCTTAC CGTCCTAGGA
CGTCTACCCG ACCTCGGAGG CTTTCGCGCC GACCTCACCG GAGCGAATCT TACCGGTGCC
GAGCTAGAAG GCGCGAATCT CTTTCCCGCA CGGCTGACTA GGGCCACTTT TACCGGATCA
CACCTAGGCA GGGTAAACCT TAAGTACGCC CAGTTGTACT TGACAAATTT CACCGATGCC
ACCATAAATT CCATAAACTT TACCCGCGCA CAAATACAAA ATACAAACTT TACGGGCACG
CTAATGATGG GTGCAGACTT TAGTGAAGCA CTGATCTCCG ACGCAGATTT CACCGACGCC
TTCCTGACTG CAACGGTTCT CACCGACGCT ATAATAAGCG CAAACTTCAC GCGCGCATTT
ATTGTGGAAG TGGATTTCTC CGGATTAAAT ATCGGCGGGA TAAACCTCAC CGATGCTCGA
CTGCAAGCAG TGAATTTCTC CGGCGCTAAA GGTCTTACAC AGGAACAGGT GGATAGCGCC
CAGGGCGACG GGCGGACGCG GTTGCCGGCG GGTCTGGTGC GGCCGGCGTC GTGGGGTCCG
GAGGAGCCGC CAGTCGGGGG CGGCTGA
 
Protein sequence
MVDNGEARRR AGRRGLWIVA GIAAVGAVCA VVGIWHLPDR MYPPGTDGGA EARAALQGGL 
LTAAAALTAV AGGLIALDET RRANAEVRQA NANTHVRELY TAAIGLLSSD AIDSRLGGIY
ALERIAWDSP ADQSTVVEVL SAFVREHARP LTDAPAGLPA EIRGRGGGGL RSRRRRGHAA
GRRSEVRHRL PPWDRFIQIG PWSNEAPPPT DVQAALTVLG RLPDLGGFRA DLTGANLTGA
ELEGANLFPA RLTRATFTGS HLGRVNLKYA QLYLTNFTDA TINSINFTRA QIQNTNFTGT
LMMGADFSEA LISDADFTDA FLTATVLTDA IISANFTRAF IVEVDFSGLN IGGINLTDAR
LQAVNFSGAK GLTQEQVDSA QGDGRTRLPA GLVRPASWGP EEPPVGGG