Gene Francci3_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1387 
Symbol 
ID3903368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1666755 
End bp1668266 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content73% 
IMG OID637878724 
Productamine oxidase 
Protein accessionYP_480493 
Protein GI86740093 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.600466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00651843 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCACGG TGACCGGAGC GACGGATCAC GTCGTCATCG TCGGGGCCGG TCTCGCCGGT 
CTGTCGGTCG CGCTGCGCCT GGTCGGAGCC GGCCGGCGGG TGACGGTGCT CGAGCGGGAT
GCGACCCCCG GCGGTCGGGC CGGCCTGCTC GCACTGGGCG ACTATCGTTT CGACACCGGA
CCGACCGTCC TGACCATGCC CGAGCTGATT GCCGACGCCC TGGACTGTGT CGGGGAGGAT
CTCGACCGCT GGCTGTCACT GACCCGGCTC GACCCGATGT ACCGGGGGTT CTTCGCCGAC
GGTTCGTCGC TGGACATCCG CGCCGACCCC GCCGACACCG CACAGGAGAT CCGGGCGCTG
TGCGGCCCGG CGGAGGCGGC CGGCTTCGAG CGGTTCGTCG GAATCGTCAC TGCAATGTTC
CGCACCCAGC TGCGCCACTT CATCGACGCG CAGGTCGACT CCCCGCTGTC GTTGCTGCGC
CCGCAGCTCG CCCGGCTGGC CGCCCTCGGC GGCTTCCGCC GGCTCGACAC CGTCGTCGGC
CGCCATCTAC GGGACCCGCG GACCCGGCGG ATGTTCTCCT TCCAGGCGAT GTACGCCGGG
CTCGCCCCGC ACGACGCACT GGCCCTCTAT GCCGTCATCT CCTACATGGA CTGCGTCGCC
GGGGTCTACC ACCCGGCCGG CGGCATGCAC GCCGTCGCCG AGGCGCTGGC CGGCGCCGCG
GCCAAACACG GTGCCACCCT GCGCTACGAG ACCACGGTGA CGCGGGTGGA GGTGCGGGGC
GGCCGGGCGG TCGCGGTGCA CACCGCGGCG GGGGAGCGCA TCGGCGCCGA CGTGGTGGTG
CTCAACCCCG ACCTGCCGAT CGCCTACACC GAACTGCTGC CTCCCTCGAC GGCGCCGGCC
CGCCTGCGCC GCCTGCGGTA CTCGCCCTCG TGCTTCCTCC TGCTGGCTGG GGCGCGCGTG
GAGTACCCGC ACACCGCGCA TCACACCATC CACTTCGGCC GGGCCTGGCG GTCGACCTTC
CGCGAGATCA TCGATCGCGG CGAGCTGATG AGCGACCCGT CCTTCCTCGT CACCACCCCC
TCACGCACCG AACCCGCGAT GGCCTCGGCC GGCGGCCACA CGTACTACGT GCTCTTCCCC
ACACCCAACC TCACCGCCCC GCTCGACTGG TCGGTGCTGC GCTCGCGCTA CCGCGACGAG
GTGGTCGCCA CCTGCGAACG CCACGGCTAC CGAGGCTTCG GCGACGCCAT CGAGGTGGAG
CAGGTCACCA CGCCCGCCGA CTGGCGGGCC CGCGGGATGG CGGCGGGCGC CCCGTTCGCG
GCGGCCCACA CCTTTCGCCA GACCGGTCCG TTCCGCCCCT CGAACCTGGC CCCGGGCCTC
GCGAACGTCG TGTTCGCCGG CAGCGGCACC CGGCCCGGCG TCGGGGTCCC GATGGTCCTC
ATTTCCGGCC GGCTGGCCGC GGAGCGGATC CTCGGCGCTG ACCGGGGTTA CCGTTCCCGC
GCCCTGTGCT GA
 
Protein sequence
MRTVTGATDH VVIVGAGLAG LSVALRLVGA GRRVTVLERD ATPGGRAGLL ALGDYRFDTG 
PTVLTMPELI ADALDCVGED LDRWLSLTRL DPMYRGFFAD GSSLDIRADP ADTAQEIRAL
CGPAEAAGFE RFVGIVTAMF RTQLRHFIDA QVDSPLSLLR PQLARLAALG GFRRLDTVVG
RHLRDPRTRR MFSFQAMYAG LAPHDALALY AVISYMDCVA GVYHPAGGMH AVAEALAGAA
AKHGATLRYE TTVTRVEVRG GRAVAVHTAA GERIGADVVV LNPDLPIAYT ELLPPSTAPA
RLRRLRYSPS CFLLLAGARV EYPHTAHHTI HFGRAWRSTF REIIDRGELM SDPSFLVTTP
SRTEPAMASA GGHTYYVLFP TPNLTAPLDW SVLRSRYRDE VVATCERHGY RGFGDAIEVE
QVTTPADWRA RGMAAGAPFA AAHTFRQTGP FRPSNLAPGL ANVVFAGSGT RPGVGVPMVL
ISGRLAAERI LGADRGYRSR ALC