Gene Franean1_1816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1816 
Symbol 
ID5670218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2179310 
End bp2180650 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content74% 
IMG OID641240737 
Producthypothetical protein 
Protein accessionYP_001506160 
Protein GI158313652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.093427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0652436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGACC AGACCCCTCC TGGCCAGCCC TCCCCCGGTC CGGAGAACCA GGAGACCTGG 
AGGGCCCCCA ACAACTCGTG GCAACAGCCA CCGGATGCCG GCGGCCAGGC GGGCGGAGCG
CCCGCGGCGG CCGGTGCTCC CGGCGCGGGC CAGGGCGGCC TGCCGTCCAC CACCCCGCCG
CCGGCGCCCG TGCCGCAGTG GGGCACCGGC GCGCCGGGCC AGGAAGGCGG CGGATGGCCG
TCCACCACGC CCGGGGGCGG CGGGGCAGGC GGCGGATGGG GCACGCCACC GGCCGGCGGG
CCCAACCCGG ACGGTTCGTG GCCCGGGGCG GGACAGCAGC AGTGGCCCCA GCAGGGCACC
GATACGCCCT GGGGGCAGCC GGGCCAGCCC GGCACCGGAT GGCAGCAGCC CGCCGGCTAC
CAGCAGGGTG GCACCCCGGG CCAGTACCAG CAGGGCTACC AGCAGCCGGC TGACTACCAG
CAGCAGGGGT ACCAGCAGCC CGGGTATCAG CAGCCCGCCG ACTACCAGCA GCAGGGGTAC
CAGCAGCCTG GCTACCAGCA GCCCGCCGAC TACCAGCAGC AGGGGTACCA GCAGCCTGGC
TACCAGCAGG GCTTCCCCCA GCAGCAGGGC TTCCCCCAGC AGCAGGGCAA CTGGCAGCAG
CCCGGCGGCC CGCCACCGGC CCGGCCCCGC CGGAACCCGG CGATGATCAT CATTCCGGTG
GCCGTCGTCG CGGTCATCGT CCTCGGGGTG GTGATCGCCC TCGCGGCCGG CGGCGACGAC
TCGAAGCCGA CGGCCACGCC GCCGGCGGTC ACGAACCTGG GCCCGGGCAC CGTGCCCACC
CTGACGGCAC CCGCCGTCCC GGGTACGACG ACGGCCCCGC AGCAGCCTGC CGGCCCCGCC
GGATGCACGC CGGTCGTGCC GCAGGGCGCG CCGCCCGCGG GCACCCTGAC CCTGGGCGGG
ACGGGCACGG TGGTCGGCAC CGCGAGCTCG TCGGTCAGCG ACTTCGAGGC CAAGGTGACG
CTGAACAGCA TCTGCAGCAC CACCGGCCCG GCCGCCGACT ACTCCGATCC GCCGGTGCAG
GGCGCCAACT ACATCCTGAA CGTGACCGTC GAGACCGTCC GCGGCGAGAC GACCGCGTCA
CCGGACGACT TCTACATCCA GACCTCGGAC GGCAGCCGGT ACGACGGCTC CTTCACCACG
GTCGAGCCGA AGCTGTTCAC CCTCGATCTG AAGGCCGGTC AGAAGGTGCG CGGCAACGTG
GTCATCGACG CCCCGGCGGG TCACCACATC CTGTCCTGGG AGCCGCTGTT CGCGACGCAG
CCGGCGAAGT TCCAGTTCTG A
 
Protein sequence
MTDQTPPGQP SPGPENQETW RAPNNSWQQP PDAGGQAGGA PAAAGAPGAG QGGLPSTTPP 
PAPVPQWGTG APGQEGGGWP STTPGGGGAG GGWGTPPAGG PNPDGSWPGA GQQQWPQQGT
DTPWGQPGQP GTGWQQPAGY QQGGTPGQYQ QGYQQPADYQ QQGYQQPGYQ QPADYQQQGY
QQPGYQQPAD YQQQGYQQPG YQQGFPQQQG FPQQQGNWQQ PGGPPPARPR RNPAMIIIPV
AVVAVIVLGV VIALAAGGDD SKPTATPPAV TNLGPGTVPT LTAPAVPGTT TAPQQPAGPA
GCTPVVPQGA PPAGTLTLGG TGTVVGTASS SVSDFEAKVT LNSICSTTGP AADYSDPPVQ
GANYILNVTV ETVRGETTAS PDDFYIQTSD GSRYDGSFTT VEPKLFTLDL KAGQKVRGNV
VIDAPAGHHI LSWEPLFATQ PAKFQF