Gene Franean1_5739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5739 
Symbol 
ID5674065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6973619 
End bp6974722 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content66% 
IMG OID641244592 
ProductNLP/P60 protein 
Protein accessionYP_001509995 
Protein GI158317487 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCTG CGCAGAGTGC GCAGCTTGAT GCAGAAAGAC GCCGACAGGA AGGGCGCGGT 
CGCCACCGGG CTCCGTCCGT CCCCACAACG TCGAGCCGGG CCAGAGCGCG GGCCCGCGCG
GTCGCAGCCG TGACCACTGG CACAGTCGTG GTCTCCGGTA TGGCTCTCGC CGGATGTGCC
CCGGAGCCCA GCTCGGACGG CGCGTTGGAC GACAGCACAA GCACCACGTC ACTGACGCTT
GCCACCCAGA TCGGGTCCCG GCCGGCGACC GACGGCGCCA TCCAGGCCGC CGCCGCGGCG
GACGGCACGC CCGGTACCGT CCTTGACGCC ACGACGGACA TCTCTGCACC GACTCTGTCG
TCCAAAATCG ACGTGGGTCT GCGCGTGACG AACCCCGACG TGACAGTCAA CGCGGACGAG
CCTGTCAACA TCGGCTTCTC GCTCTTCAAC GAGGAGACCC ACGCCCCGCT GGCGGACCAG
CTCATCAAGG TGCAGGTCAA ACTACCCACC GGCTGGGCGA CCTTCCTGCA CCTGACCACC
GACGAGCATG GCATCGCCTC CTACACGGCG CGTGTCCTCA CCACCACGAA CGTCACGGTG
ATCTTCGATG GAACGGACGC CCTGCAATCC GCCCGCTCCG AGAACGAAGC GACCCTGCGC
GTGCGCCCAG CCGCGCCACC GGCGTCCATC AGGGCCTCCC GCGGCACAGT GAATGCTGAA
ACTCCGACGA TCGGCATCGA TGTCCCGGCC AACACGCTCG GGGAAAAAGC CGTATACCTG
GCGTCCCTGC AAGCCGGTAA GCCCTACGTT TACGGGGCCA CCGGACCGTA CAGCTTCGAC
TGCTCCGGTT TGGTGCAATA CATCTACAAG CAGCTCGGCA AGACACTTCC CCGTACCACC
GACCAGCAGT ACGCGGCGAC AACCAGGGTC GCCCAGGGCT CCGAACAGCC AGGCGATCTC
ATCTTCTTCG GCCAGCCTGG CGCGATCTAT CATATGGGGA TCTACGCCGG TGGCGGGAAA
ATGTGGGTCG CGCCGAAGAC CGGTGACGTG GTGAAGCTCC AGACCATCTG GGCAGACTCC
TACTCGGTCG GTCGGGTGAC CTGA
 
Protein sequence
MLSAQSAQLD AERRRQEGRG RHRAPSVPTT SSRARARARA VAAVTTGTVV VSGMALAGCA 
PEPSSDGALD DSTSTTSLTL ATQIGSRPAT DGAIQAAAAA DGTPGTVLDA TTDISAPTLS
SKIDVGLRVT NPDVTVNADE PVNIGFSLFN EETHAPLADQ LIKVQVKLPT GWATFLHLTT
DEHGIASYTA RVLTTTNVTV IFDGTDALQS ARSENEATLR VRPAAPPASI RASRGTVNAE
TPTIGIDVPA NTLGEKAVYL ASLQAGKPYV YGATGPYSFD CSGLVQYIYK QLGKTLPRTT
DQQYAATTRV AQGSEQPGDL IFFGQPGAIY HMGIYAGGGK MWVAPKTGDV VKLQTIWADS
YSVGRVT