Gene Franean1_4117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4117 
Symbol 
ID5672475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4900559 
End bp4902229 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content72% 
IMG OID641242993 
ProductTAP domain-containing protein 
Protein accessionYP_001508410 
Protein GI158315902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAG ATGAGAAACG CACGCGGATG TGCGGTGTCG TTGTCGGCGC CGCGGCCCTC 
GTCGGGGTCG GCTCCTTCAC CGGGCCGATC CCCGGGGCGC TGGCGGCGGA CGGCGCGCCC
GATCCCGCCC CGCCCGGTGT GGCGAGCCTG AACCCGGCCC TCGCCCCGTT CGAGAACCAG
CCGGTGCGCT GGCACGAATG CCGGACCGGC CCCAACGACG CGATCGGCTC CTATCTGGAC
GCGGCCGGCG CGCAGTGCGC CGGGATCAGC GTCCCGCTGG ACTACGCCCG CCCCGACGGC
CGGGAGATCA CCCTCGGGGT GTCCCGGATC AGGGCAGCCG ACACCGCCCA CCGGCGCGGC
ATTCTGATGA TCAATATTGG CGGTCCCGGC GGCCCTGCTC TGGACGCCAC CCCGGACCTG
CGTGGGCTGC TCGGTGCGGC GGCGGACGGC TTCGACGTGG TCGGGATGGA CCCTCGTTTC
GTCGGCCGCA GCGCACCGCT GGACTGCGGG CCGATCCTGA AGCGGCCCTG GCCGCGCGCC
GGCGGCCCCG CCCAGGACAG TTTCGATCGC ACCGCCGACG GTCAGGCCGC GATAGCGCAG
GCGTGCGCGG TGCACGCCGA TGTGCTGCCG TTCGCGTCGA CGCGGAACAC CGCTCGTGAC
ATGGATGTTG TCCGTGCTGC GCTTGATGAG CGGAAGACGT CGTTTCTTGG CTTTTCCTAC
GGCACCTACC TGGGTGCGGT GTACATGCAG ATGTTCCCTG ACCGGGTCGA TCGTTTCGTG
CTCGACAGCG CGGTGGATCC GGCGACGTAC AACCCGCGGG TGCTGCGCGA CACCGGGGAT
CTGCTCGAGG GCGCGCTGCG GGAGTGGGCC GGCTGGGCCG CCGGGCGGGA CGCCCAGTTC
GGGCTTGGCA GCACGGCCGC GGAGGTGCTG GCCACCGTTG ACCGGATCTA CGCCGCGGCC
GTGCGCGGTC CGCTGACCGT GGACGGCGTG GAGTACGACG CCGGGGACAT CCCGGGGCTG
CTCATTGATG CACTCGTCGA CGACAGCGCT GAGGCGTCCG ACGTCCTGGC GATGGGTGTG
CGGGCCTTCG CTGACGCGGC GGATGGCCGT GCCCCGGCGG AGAACCCCTA CCTCGACGAG
TTCTTGGACG GTGTCGCTAC CGGCGGTCCG ACGCTGCCCG GCGGGTCACC CGGGCGCCGG
GTGGAGGCGG TGCCGGCGGG GTCGATCAGC GCGTTCCAGA GCGCGCAGCT CGCGGTCCTG
TGCGGGGATG TGCCGGCGTC CCGTGTGGCC GGTGAGTACC TGGCGGACAT CCGCCGTCAT
CAGCGGGCGC AGCCGCATGT GGGTGCGGCG ATCTGGAACC TGACCCCGTG CACGTTCTGG
CCGGTGCGCC CGGTGGAGGC ACCGACCCGG GTGGCGAATG CTGTGCCCGC GCTGGTGGTG
GCCGCGGAGA AGGACAACCG CACGCCGTAC GCGGGCAGCC GGGCGCTGCA CCGGGCGTTG
TCCTCGTCGC GACTGGTGAC GTTGCGCGGA GCACGGGTGC ACGGCGTGTA CGGCGTGCGC
AGCGGCTGTG TCGACGACGC AGTCAATGCC TACCTGCGGT CGGGCACCCT GCCCAGCGCC
GACCTCACCT GCACCCGTCC GCCGGCTCCC CCGGGTTCGC TTCCGGAGTG A
 
Protein sequence
MRRDEKRTRM CGVVVGAAAL VGVGSFTGPI PGALAADGAP DPAPPGVASL NPALAPFENQ 
PVRWHECRTG PNDAIGSYLD AAGAQCAGIS VPLDYARPDG REITLGVSRI RAADTAHRRG
ILMINIGGPG GPALDATPDL RGLLGAAADG FDVVGMDPRF VGRSAPLDCG PILKRPWPRA
GGPAQDSFDR TADGQAAIAQ ACAVHADVLP FASTRNTARD MDVVRAALDE RKTSFLGFSY
GTYLGAVYMQ MFPDRVDRFV LDSAVDPATY NPRVLRDTGD LLEGALREWA GWAAGRDAQF
GLGSTAAEVL ATVDRIYAAA VRGPLTVDGV EYDAGDIPGL LIDALVDDSA EASDVLAMGV
RAFADAADGR APAENPYLDE FLDGVATGGP TLPGGSPGRR VEAVPAGSIS AFQSAQLAVL
CGDVPASRVA GEYLADIRRH QRAQPHVGAA IWNLTPCTFW PVRPVEAPTR VANAVPALVV
AAEKDNRTPY AGSRALHRAL SSSRLVTLRG ARVHGVYGVR SGCVDDAVNA YLRSGTLPSA
DLTCTRPPAP PGSLPE