Gene Franean1_4481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4481 
Symbol 
ID5672831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5346623 
End bp5348143 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID641243348 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508764 
Protein GI158316256 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATA GCACCAGGAG TCCCGCATCG TTAGGTCGGA CGTTGCTTGC GATAGCTGCT 
GACCGGCCAG CTCACCCGGC GGTAACCACC CGGTCGGGGT CGACTTCCTA CCGGGAGCTG
GCGGACGGTG CGCTGCGCGT CGCGGCAGCA CTGACGGTGC AGGGGCTCGG TCTTGGCGAC
CGAATAGCGA TCCTGGCTCG TAACGATCTA CCCTACGTCG AGCTCATCTA CGGCGCGGAC
TTTGTCGGTG CGGTGGTCGT CGGGATCAAT TGTCGGCTCT CGCCGGCGGA GGTTGCCGAT
ATTCTCGACG ACTGCCAGCC CAGCCTTGTG TTCGTCGCGG ACGAGTACCT GCCGCTGCTG
GGTTCCGCCG CCGCGGGCGT CCTCCGGGTG TCTCTTGATC GCGACTACCG GACATGGTGC
GGCACCGGGG ACATGACGCG GTTCGTGCCG CGGGTCGGTT ACGCCGACAG TGTGGTACTC
ATGGTCTACA CCAGCGGCAC CACCGGTCGG TCGAAAGGTG TGCGACTGAC CGAGGCCAAC
ATCACGGCCG CCCTCGCCGC GAACAGAGAT GTCTGGTTCG TTGGCCCGGA GATCCGGGCG
CTGGCGCTTT TTCCGCTGTT TAATATCAGC GGTTCGATCT TTCTGCTCTC GATCTTACAT
GTCGGCGGTG AAGTCGTCAT TGCCGAGAAC GCGTCAGGCG CCACCATCCT GGAACTTCTC
GGGGCGAGGC GCATTACCCA CGCGCTGTTC GTGGCGGCGA TGATCGTCGC GCTGCTTGAT
CAACCGGCCG ACGACGAGAT CGACCTGTCC AGCCTGCGAG TACTGATCTA TGGTGCCGCT
CCGTCTTCGG CGGCCGTGAT CGACCGGGCT ATGCGGCGGC TGCCGACCTG TGATTTCTTT
CAGGGATACG GGATGACGGA GACCTGTGGC GGCATCGCGA TGACGCCGCC GCATCGATAC
GGCGAAGAGA TCGCGCCGGC ATCGGTGGGG CGAGCCATAC CATCCTATGA GATTCGGATT
GTGGATCCGG TCAGGCGCAC CGACCTGCCG GTTGGTGTGG AAGGCGAGAT CTGGGCGCGC
GGGCCACAGA ACACCATCGG CTACTGGAAC CGGGCCGAGG AGACCGATCG TCTGCTCGCC
GCGGACGGGT GGCTTCGTAC CGGTGATGTC GGTGTCCTAG ACGCCGCTCA CAACCTCTAT
GTCGTAGACC GCCTCAAAGA CATGATTATT TCGGGTGGGT TCAACGTCTA TTCGCTCGAG
GTCGAGCAGA TCCTGGTCGG CCACCCGGAT GTCGGTGATG CGGCGGTGTT CGGCGTGCCC
GACGAGCGTT GGGGCGAGAC CGTGGTGGCC GTGGTGACCC TGCGTCCGGG CGCCACCTGC
GTCCCGGCCG ACCTGAGTGA GTTCGCCCGG GCGCGGTTGG CCCACTTCAA ATGCCCGCGG
CGGATCGAGA TTCTCGACGA ACTTCCGAGG AATGCGGCCG GAAAGATCCT CAAAAGAGAG
CTTCGCGGCC GGTTCAGCTG A
 
Protein sequence
MTNSTRSPAS LGRTLLAIAA DRPAHPAVTT RSGSTSYREL ADGALRVAAA LTVQGLGLGD 
RIAILARNDL PYVELIYGAD FVGAVVVGIN CRLSPAEVAD ILDDCQPSLV FVADEYLPLL
GSAAAGVLRV SLDRDYRTWC GTGDMTRFVP RVGYADSVVL MVYTSGTTGR SKGVRLTEAN
ITAALAANRD VWFVGPEIRA LALFPLFNIS GSIFLLSILH VGGEVVIAEN ASGATILELL
GARRITHALF VAAMIVALLD QPADDEIDLS SLRVLIYGAA PSSAAVIDRA MRRLPTCDFF
QGYGMTETCG GIAMTPPHRY GEEIAPASVG RAIPSYEIRI VDPVRRTDLP VGVEGEIWAR
GPQNTIGYWN RAEETDRLLA ADGWLRTGDV GVLDAAHNLY VVDRLKDMII SGGFNVYSLE
VEQILVGHPD VGDAAVFGVP DERWGETVVA VVTLRPGATC VPADLSEFAR ARLAHFKCPR
RIEILDELPR NAAGKILKRE LRGRFS