Gene Franean1_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1549 
Symbol 
ID5669952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1851020 
End bp1852417 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content73% 
IMG OID641240468 
Producthypothetical protein 
Protein accessionYP_001505894 
Protein GI158313386 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.400172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.819759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCACCGG ACCTGGTCGA ACAGCTGCTC ACCCGCCTCG CGGACGGCTG CCCGCCGATG 
CCCGGCGCCA CGGACCCGCC GGCACCGTCC AGGCCCTCGG GGGACGAGGT GGCGGGCCAG
GGCACGCTCC TGAGCCACAA GGACCTCTGG CGCGAGCTCC TCGACGCCGC GGACCGCGAG
CCCATCGAAT CCTGGATGAC CTGGCTGCAC CCGAAGCAGG TCCGGCTGGC CGGACGGCAG
TGGTCGGGGC CGGCCCGCGT GCGCGGCGCG GCCGACACCG GCAAGACGGT CGTCGCACTG
CACCGCGCCA AGTACCTGGC CGCGCGCGGG GAACGGGTGC TGTTCACCAC GCTGGTGCGG
ACCCTCGGCC CGGTCTACCG CGCGCTGCTC GCCCGGATGG CCCCCGACCA GGTCGACCGG
GTCGAGTTCG CCACCGTGCA CGCCGTCGCC GCCCGCTGCC TGCGCGAGCA CGGCCTGACC
GACTTCGCGC AGTACGCGAA GCTGGCCCGG GTCGGGCGCA GCACCCCTTT ACAGCCGACC
CACCGCCGCG CGGTGTGGGA GCTCCACGAG CGGTACGAGC AGCTCCGGGT GGAGCGCGGC
GTCCTCGACC GCGAGCGGAT TCTGCGCTAC GCCCAGGCGG TGCTCGCCGA CGACAGCTTC
GAGGATCTCG ACGGCGTTCG GGAGCAGGGG CACCGCGAGG TCGACGTCGA GCGCCCCGGC
GGCGAGATCC ACGAGGTCAC CGTGTCCGGC GAGGCGGCAC AGGACACCGC GCTCTGCGAC
CATCTCGTCG AGCTCCGGCA GCGCCGGAAC GTGCGTTACG GCGACATGGC GGTGCTTGTG
CCGACGAACG AATCGGAGCG GCGATGGCTG CGGGTGCTCG CCGAACGGGG AATTCCCGCG
GTCTCCCTCA TGCAGTACGA CGGGTCCACC TGCGAGGCGG TCAAGGTCGG GACGTACTTC
CGCGCCAAAA GCCTCGATTT CGCCCACGTC TGCATTCCCG ACCGTAATCT CTTCCCGCGG
CCGCAGCAGC CGTCCGAGTC GGCCGACGCG TTCGGTGAAC GCGTCCAGCT GGAGCGGCGG
CAGTTGTACG TCGCCATCAC GAGGGCTCGG GACAGTGTGT GGGCCGGCAT TCACGCCCGG
CCCTGCCCGG AACACCAGCC GCCCGGCATC GGTCGGATGC CGGTCCGAGC CGGGACGAAC
GGTGCGCAGA CGGTCTACTC GGACGGGACC GGTCCTGGAC GTGACCGCGG CACGGTCGCC
AGCGGCACGG TGGCCGACGG GTCGGCAAAT GACGGACCGC CGAATGCCGG GAAGTCGTGG
GCGGGGACGC CGTGTTCCGG GCGGTGGGGT TCCGGGGTGG CGGGTCGCTG GATGTCGCCG
CGGGTGACCG CGAGGTAG
 
Protein sequence
MAPDLVEQLL TRLADGCPPM PGATDPPAPS RPSGDEVAGQ GTLLSHKDLW RELLDAADRE 
PIESWMTWLH PKQVRLAGRQ WSGPARVRGA ADTGKTVVAL HRAKYLAARG ERVLFTTLVR
TLGPVYRALL ARMAPDQVDR VEFATVHAVA ARCLREHGLT DFAQYAKLAR VGRSTPLQPT
HRRAVWELHE RYEQLRVERG VLDRERILRY AQAVLADDSF EDLDGVREQG HREVDVERPG
GEIHEVTVSG EAAQDTALCD HLVELRQRRN VRYGDMAVLV PTNESERRWL RVLAERGIPA
VSLMQYDGST CEAVKVGTYF RAKSLDFAHV CIPDRNLFPR PQQPSESADA FGERVQLERR
QLYVAITRAR DSVWAGIHAR PCPEHQPPGI GRMPVRAGTN GAQTVYSDGT GPGRDRGTVA
SGTVADGSAN DGPPNAGKSW AGTPCSGRWG SGVAGRWMSP RVTAR