Gene Franean1_5539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5539 
Symbol 
ID5673869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6707141 
End bp6708862 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content72% 
IMG OID641244395 
Productreplication initiator protein 
Protein accessionYP_001509799 
Protein GI158317291 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.95807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTCC CCTCCCTCAC CGGCCCTGAC CCGGCCGTCC CCCACGACCC CACCATCCGT 
GACGATCGTG ACGATCGGCC CGGCTCGCGG ATGGCACGGA TGCGAATGCC GCTCGCCCGT
GCCGTCGTCG AAGCGGTCGC CGTGGAGAAC GGGGTGTGCG TTCGACCCAT GGCCATGCGC
CGCACGAACC TCGACACCGG CGAAACCGAG ATCATCCCCG TGCCCTGCGG CGCGACCCTC
GCCAGCAAGT GCCCGCCGTG CGCGGAGAAG GCCCGGCGGC TGCGGATGGC GCAGTGCAAA
GCGGGCTGGC ATCTCGATGA CGAACCCCTT CCCGACCCAG ACCCGCCGTC CGATGACGCC
AAGGTCCTCG CCGGGTTCCG CGCGGACCTG GAAGTCGCCC GGCGCGACGC GGAACGGGAC
AGTGACCCGG CCGGCGTCGC CGAGATCGAC ACGCTTATCG ACCAGGTGGA CGACGAACTC
AACGCCCTGG GCGTCCGCGG CAAAGCCGCT CCCGACAACC GGGACCGGCC ACGTCGTGCC
CGCTCGACCC GCCGGCGGCA GGACGCTGCC GATCTGCCGC GGCTGCCCGT GGAAAAGCGC
ACCGTCGGTC GGACGTACGA AGCGGCGGAC GGCACGACCT GGCGGCCGTC GATGTTCCTG
ACCCTCACCT GCGACACGTA CGGCCGGGTG ACCTCCGACG GCACTCCCGT CGATCCGGGC
TCGTACGACT ACCGGCGGGC GGCCCGCGAC GCGATCCACT TCCCGAAGCT GATCGACCGG
TTCTGGCAGA ACCTGCGTAG GGCCGTGGGC TGGGATGTGC AGTACTTCGC CGCGCTCGAA
CCCCAACGGC GGCTCGCCCC GCACCTGCAC GCGGCCGTCC GCGGAACCGT GCCGCGGGCC
CTGCTTCGGC AGGTGGCGGC AGCGACGTAT CACCAGGTGT GGTGGCCACC CTGCGACCAG
GCGGTCTACC CGGACACGGC CCTCCCGACC TGGGCGAACG ACACGAGTGG GTACGTCGAT
CCGGTTTCCG GCCGGTCGCT GCCGACCTGG GACGCGGCCC TTGACGCCGT CGGAGACGAG
GACGAACCGT CCCACGTGGT GCGCTTCGGC CCTCAACTGC AAGCGGACGG CTTCACCGCG
AACTCGGCGC ACACCGGCCG CATGATCGGC TACCTGTGCA AGTACCTGAC CAAGAGCCTC
GACGCCTGCC ACGCGGCCAC CACCGACCGG CAACGGCGGC ACGTCGACCG GCTCGCCGAA
GCCCTGCGCT ACGAACCTTG CGCCCCCACC TGCGCCAACT GGCTCCGCTA CGGGATCCAG
CCGAAGAACC CGAAACCGGG TCTCGTCCCG GGCCGCTGCC GCGGCAAGGC GCACCGGCGG
GAGACCCTCG GCTTCGGCGG ACGGCGGGTG CTGGTCAGTC GCAAGTGGTC CGGCAAGACA
CTGACCGACC ACAAGCATGA TCGCGTCGCG TTCATCCGGG CACAGCTCGA AGCCCTCGGC
CACACCGCCA CCGGCCCGGC CGCGGCGACC GACACCGACC CGGCGCGCAC CGTCTGGACG
CTGCTCCGGC CCGGCGACCC GGCCGCACCG CGGCGTGAAT ACCTGCTGTT GCAGGCGGTC
GCGCAACGCC ACGCCTGGCG CGCACAGCTC GACGCGGCCC GGCGCACCGC ACCCGACGAA
CTTCCGGCAA TCGGCCCACC GGGCAGGGCA CAAGCCGCCT GA
 
Protein sequence
MTLPSLTGPD PAVPHDPTIR DDRDDRPGSR MARMRMPLAR AVVEAVAVEN GVCVRPMAMR 
RTNLDTGETE IIPVPCGATL ASKCPPCAEK ARRLRMAQCK AGWHLDDEPL PDPDPPSDDA
KVLAGFRADL EVARRDAERD SDPAGVAEID TLIDQVDDEL NALGVRGKAA PDNRDRPRRA
RSTRRRQDAA DLPRLPVEKR TVGRTYEAAD GTTWRPSMFL TLTCDTYGRV TSDGTPVDPG
SYDYRRAARD AIHFPKLIDR FWQNLRRAVG WDVQYFAALE PQRRLAPHLH AAVRGTVPRA
LLRQVAAATY HQVWWPPCDQ AVYPDTALPT WANDTSGYVD PVSGRSLPTW DAALDAVGDE
DEPSHVVRFG PQLQADGFTA NSAHTGRMIG YLCKYLTKSL DACHAATTDR QRRHVDRLAE
ALRYEPCAPT CANWLRYGIQ PKNPKPGLVP GRCRGKAHRR ETLGFGGRRV LVSRKWSGKT
LTDHKHDRVA FIRAQLEALG HTATGPAAAT DTDPARTVWT LLRPGDPAAP RREYLLLQAV
AQRHAWRAQL DAARRTAPDE LPAIGPPGRA QAA